Connecting via Winsock to STN 



LOGINID:sssptl89dxw 
PASSWORD: 

TERMINAL (ENTER 1, 2, 3, OR ?):2 



Welcome to STN International 



NEWS 


1 






Web Page for STN Seminar Schedule - N. America 


NEWS 


2 


JAN 


02 


STN pricing information for 2008 now available 


NEWS 


3 


JAN 


16 


CAS patent coverage enhanced to include exemplified 
prophetic substances 


NEWS 


4 


JAN 


28 


USPATFULL, USPAT2, and USPATOLD enhanced with new 
custom IPC display formats 


NEWS 


5 


JAN 


28 


MARPAT searching enhanced 


NEWS 


6 


JAN 


28 


USGENE now provides USPTO sequence data within 3 days 
of publication 


NEWS 


7 


JAN 


28 


TOXCENTER enhanced with reloaded MEDLINE segment 


NEWS 


8 


JAN 


28 


MEDLINE and LMEDLINE reloaded with enhancements 


NEWS 


9 


FEB 


08 


STN Express, Version 8.3, now available 


NEWS 


10 


FEB 


20 


PCI now available as a replacement to DPCI 


NEWS 


11 


FEB 


25 


IFIREF reloaded with enhancements 


NEWS 


12 


FEB 


25 


IMSPRODUCT reloaded with enhancements 


NEWS 


13 


FEB 


29 


WPINDEX/WPIDS/WPIX enhanced with ECLA and current 
U.S. National Patent Classification 


NEWS 


14 


MAR 


31 


IFICDB, IFIPAT, and IFIUDB enhanced with new custom 

IPC display formats 


NEWS 


15 


MAR 


31 


CAS REGISTRY enhanced with additional experimental 
spectra 


NEWS 


16 


MAR 


31 


CA/CAplus and CASREACT patent number format for U.S. 
applications updated 


NEWS 


17 


MAR 


31 


LPCI now available as a replacement to LDPCI 


NEWS 


18 


MAR 


31 


EMBASE, EMBAL, and LEMBASE reloaded with enhancements 


NEWS 


19 


APR 


04 


STN AnaVist, Version 1, to be discontinued 


NEWS 


20 


APR 


15 


WPIDS, WPINDEX, and WPIX enhanced with new 
predefined hit display formats 


NEWS 


21 


APR 


28 


EMBASE Controlled Term thesaurus enhanced 


NEWS 


22 


APR 


28 


IMSRESEARCH reloaded with enhancements 


NEWS 


23 


MAY 


30 


INPAFAMDB now available on STN for patent family 
searching 


NEWS 


24 


MAY 


30 


DGENE, PCTGEN, and USGENE enhanced with new homology 
sequence search option 


NEWS 


25 


JUN 


06 


EPFULL enhanced with 260,000 English abstracts 


NEWS 


26 


JUN 


06 


KOREAPAT updated with 41,000 documents 


NEWS 


27 


JUN 


13 


USPATFULL and USPAT2 updated with 11-character 
patent numbers for U.S. applications 


NEWS 


28 


JUN 


19 


CAS REGISTRY includes selected substances from 
web-based collections 


NEWS 


29 


JUN 


25 


CA/CAplus and USPAT databases updated with IPC 
reclassification data 


NEWS 


30 


JUN 


30 


AEROSPACE enhanced with more than 1 million U.S. 
patent records 


NEWS 


31 


JUN 


30 


EMBASE, EMBAL, and LEMBASE updated with additional 
options to display authors and affiliated 



organizations 

NEWS 32 JUN 30 STN on the Web enhanced with new STN AnaVist 

Assistant and BLAST plug-in 
NEWS 33 JUN 30 STN AnaVist enhanced with database content from EPFULL 



NEWS EXPRESS JUNE 27 08 CURRENT WINDOWS VERSION IS V8.3, 

AND CURRENT DISCOVER FILE IS DATED 23 JUNE 2008. 



NEWS HOURS STN Operating Hours Plus Help Desk Availability 
NEWS LOGIN Welcome Banner and News Items 

NEWS IPCS For general information regarding STN implementation of IPC 8 

Enter NEWS followed by the item number or name to see news on that 
specific topic. 

All use of STN is subject to the provisions of the STN Customer 
agreement. Please note that this agreement limits use to scientific 
research. Use for software development or design or implementation 
of commercial gateways or other similar uses is prohibited and may 
result in loss of user privileges and other penalties. 

************** STN Columbus *************** 



FILE 'HOME' ENTERED AT 13:55:21 ON 07 JUL 2008 



=> index bioscience 

FILE 'DRUGMONOG' ACCESS NOT AUTHORIZED 

COST IN U.S. DOLLARS SINCE FILE TOTAL 

ENTRY SESSION 

FULL ESTIMATED COST 0.21 0.21 



INDEX 'ADISCTI, ADISINSIGHT, ADISNEWS, AGRICOLA, ANABSTR, ANTE, AQUALINE, 

AQUASCI, BIOENG, BIOSIS, BIOTECHABS, BIOTECHDS, BIOTECHNO, CABA, CAPLUS, 
CEABA-VTB, CIN, CONFSCI, CROPB, CROPU, DDFB, DDFU, DGENE, DISSABS, DRUGB, 
DRUGM0N0G2, DRUGU, EMBAL, EMBASE, ...' ENTERED AT 13:55:47 ON 07 JUL 2008 



69 FILES IN THE FILE LIST IN STNINDEX 



Enter SET DETAIL ON to see search term postings or to view 
search error messages that display as 0* with SET DETAIL OFF. 

=> s Lactobacillus reuteri and toxin (p) bind? and CD4+ cell? and (food or tablet or 
dietary supplement or confectionery or drug) 
0* FILE ADISCTI 



=> s Lactobacillus reuteri and toxin (p) bind? and CD4+ cells 
0* FILE ADISCTI 



> s Lactobacillus reuteri and toxin (p) bind? 

0* FILE ADISNEWS 

0* FILE ANTE 

0* FILE AQUALINE 

0* FILE BIOENG 

1* FILE BIOTECHABS 

1* FILE BIOTECHDS 

0* FILE BIOTECHNO 

2 FILE CAPLUS 

0* FILE CEABA-VTB 

0* FILE CIN 
27 FILES SEARCHED. . . 

0* FILE ESBIOBASE 

0* FILE FOMAD 



0* FILE FOREGE 

2* FILE FROSTI 

0* FILE FSTA 

5 FILE GENBANK 

1 FILE IFIPAT 

0* FILE KOSMET 

0* FILE NTIS 

0* FILE NUTRACEUT 

1* FILE PASCAL 

0* FILE PHARMAML 

1 FILE SCISEARCH 

1 FILE TOXCENTER 
60 FILES SEARCHED. . . 

21 FILE USPATFULL 
3 FILE USPAT2 
0* FILE WATER 

2 FILE WPIDS 

2 FILE WPINDEX 

13 FILES HAVE ONE OR MORE ANSWERS, 69 FILES SEARCHED IN STNINDEX 
LI QUE LACTOBACILLUS REUTERI AND TOXIN (P) BIND? 

=> file biotechds biotechabs caplus frosti genbank ifipat pascal scisearch 
toxcenter uspatfull uspat2 

COST IN U.S. DOLLARS SINCE FILE TOTAL 

ENTRY SESSION 

FULL ESTIMATED COST 3.90 4.11 

FILE 'BIOTECHDS' ENTERED AT 13:59:37 ON 07 JUL 2008 
COPYRIGHT (C) 2008 THOMSON REUTERS 

FILE 'BIOTECHABS' ACCESS NOT AUTHORIZED 

FILE 'CAPLUS' ENTERED AT 13:59:37 ON 07 JUL 2008 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS. 

COPYRIGHT (C) 20 08 AMERICAN CHEMICAL SOCIETY (ACS) 

FILE 'FROSTI' ENTERED AT 13:59:37 ON 07 JUL 2008 
COPYRIGHT (C) 2008 Leatherhead Food Research Association 

FILE 'GENBANK' ENTERED AT 13:59:37 ON 07 JUL 2008 

FILE 'IFIPAT' ENTERED AT 13:59:37 ON 07 JUL 2008 
COPYRIGHT (C) 2008 IFI CLAIMS (R) Patent Services (IFI) 

FILE 'PASCAL' ENTERED AT 13:59:37 ON 07 JUL 2008 

Any reproduction or dissemination in part or in full, 

by means of any process and on any support whatsoever 

is prohibited without the prior written agreement of INIST-CNRS. 

COPYRIGHT (C) 2008 INIST-CNRS. All rights reserved. 

FILE 'SCISEARCH' ENTERED AT 13:59:37 ON 07 JUL 2008 
Copyright (c) 2008 The Thomson Corporation 

FILE 'TOXCENTER' ENTERED AT 13:59:37 ON 07 JUL 2008 
COPYRIGHT (C) 2008 AMERICAN CHEMICAL SOCIETY (ACS) 



FILE 'USPATFULL' ENTERED AT 13:59:37 ON 07 JUL 2008 

CA INDEXING COPYRIGHT (C) 2008 AMERICAN CHEMICAL SOCIETY (ACS) 



FILE 'USPAT2' ENTERED AT 13:59:37 ON 07 JUL 2008 

CA INDEXING COPYRIGHT (C) 2008 AMERICAN CHEMICAL SOCIETY (ACS) 

=> s 11 

PROXIMITY OPERATOR LEVEL NOT CONSISTENT WITH 

FIELD CODE - 'AND' OPERATOR ASSUMED ' TOXIN (P) BIND?' 

PROXIMITY OPERATOR LEVEL NOT CONSISTENT WITH 

FIELD CODE - 'AND' OPERATOR ASSUMED ' TOXIN (P) BIND?' 

PROXIMITY OPERATOR LEVEL NOT CONSISTENT WITH 

FIELD CODE - 'AND' OPERATOR ASSUMED ' TOXIN (P) BIND?' 

L2 38 LI 



=> rem dup 12 

DUP IS NOT VALID HERE 

The DELETE command is used to remove various items stored by the 
system. 

To delete a saved query, saved answer set, saved L-number list, SDI 
request, batch request, mailing list, or user-defined cluster, format, 
or search field, enter the name. The name may include ? for left, 
right, or simultaneous left and right truncation. 



Examples : 



DELETE 


BIO?/Q 


- delete 


DELETE 


?DRUG/A 


- delete 


DELETE 


?ELEC?/L 


- delete 


DELETE 


ANTICOAG/S 


- delete 


DELETE 


ENZYME/B 


- delete 


DELETE 


.MYCLUSTER 


- delete 


DELETE 


.MYFORMAT 


- delete 


DELETE 


.MYFIELD 


- delete 


DELETE 


NAMELIST MYLIST 


- delete 



query names starting with BIO 
answer set names ending with DRUG 
L-number lists containing ELEC 
SDI request 
batch request 
user-defined cluster 
user-defined display format 
user-defined search field 
mailing list 



To delete an ordered document or an offline print, enter its 
number . 



DELETE P123001C - delete print request 

DELETE D134002C - delete document order request 



To delete an individual L-number or range of L-numbers, enter the 
L-number or L-number range. You may also enter DELETE LAST followed 
by a number, n, to delete the last n L-numbers. RENUMBER or 
NORENUMBER may also be explicitly specified to override the value of 
SET RENUMBER. 



DELETE L21 
DELETE L3-L6 
DELETE LAST 4 
DELETE L33- 
DELETE -L55 

DELETE L2-L6 RENUMBER ■ 
DELETE RENUMBER 



delete a single L-number 
delete a range of L-numbers 
delete the last 4 L-numbers 
delete L33 and any higher L-number 
delete L55 and any lower L-number 
delete a range of L-numbers and 
renumber remaining L-numbers 
renumber L-numbers after deletion of 
intermediate L-numbers 



Entire sets of saved items, SDI requests, batch requests, user-defined 
items, or E-numbers can be deleted. 



Examples : 



DELETE 


SAVED /Q 


- delete 


all 


saved queries 


DELETE 


SAVED/A 


- delete 


all 


saved answer sets 


DELETE 


SAVED /L 


- delete 


all 


saved L-number lists 


DELETE 


SAVED 


- delete 


all 


saved queries, answer sets. 






and L-number lists 


DELETE 


SAVED /S 


- delete 


all 


SDI requests 


DELETE 


SAVED /B 


- delete 


all 


batch requests 


DELETE 


CLUSTER 


- delete 


all 


user-defined clusters 


DELETE 


FORMAT 


- delete 


all 


user-defined display formats 


DELETE 


FIELD 


- delete 


all 


user-defined search fields 


DELETE 


SELECT 


- delete 


all 


E-numbers 


DELETE 


HISTORY 


- delete 


all 


L-numbers and restart the 






session at 


LI 



To delete an entire multifile SDI request, enter DELETE and 
the name of the request. To delete a component from the 
multifile SDI, enter DELETE and the name of the component. 

=> dup rem 12 

DUPLICATE IS NOT AVAILABLE IN 'GENBANK'. 

ANSWERS FROM THESE FILES WILL BE CONSIDERED UNIQUE 

PROCESSING COMPLETED FOR L2 

L3 3 4 DUP REM L2 (4 DUPLICATES REMOVED) 

=> s 13 and (tablet or food or dietary suplement or confectionery or drug) and 
product 

L4 27 L3 AND (TABLET OR FOOD OR DIETARY SUPLEMENT OR CONFECTIONERY 

OR DRUG) AND PRODUCT 

=> rem dup 14 

DUP IS NOT VALID HERE 

The DELETE command is used to remove various items stored by the 
system. 

To delete a saved query, saved answer set, saved L-number list, SDI 
request, batch request, mailing list, or user-defined cluster, format, 
or search field, enter the name. The name may include ? for left, 
right, or simultaneous left and right truncation. 



Examples ; 








DELETE 


BI07/Q 


- delete 


query names starting with BIO 


DELETE 


7DRUG/A 


- delete 


answer set names ending with DRUG 


DELETE 


?ELEC?/L 


- delete 


L-number lists containing ELEC 


DELETE 


ANTICOAG/S 


- delete 


SDI request 


DELETE 


ENZYME/B 


- delete 


batch request 


DELETE 


.MYCLUSTER 


- delete 


user-defined cluster 


DELETE 


. MYFORMAT 


- delete 


user-defined display format 


DELETE 


.MYFIELD 


- delete 


user-defined search field 


DELETE 


NAMELIST MYLIST 


- delete 


mailing list 



To delete an ordered document or an offline print, enter its 
number . 



Examples : 



DELETE P123001C 
DELETE D134002C 



- delete print request 

- delete document order request 



To delete an individual L-number or range of L-numbers, enter the 
L-number or L-number range. You may also enter DELETE LAST followed 
by a number, n, to delete the last n L-numbers. RENUMBER or 
NORENUMBER may also be explicitly specified to override the value of 
SET RENUMBER. 



Examples ; 






DELETE 


L21 


- delete a single L-number 


DELETE 


L3-L6 


- delete a range of L-numbers 


DELETE 


LAST 4 


- delete the last 4 L-numbers 


DELETE 


L33- 


- delete L33 and any higher L-number 


DELETE 


-L55 


- delete L55 and any lower L-number 


DELETE 


L2-L6 RENUMBER 


- delete a range of L-numbers and 






renumber remaining L-numbers 


DELETE 


RENUMBER 


- renumber L-numbers after deletion of 



intermediate L-numbers 

Entire sets of saved items, SDI requests, batch requests, user-defined 
items, or E-numbers can be deleted. 



Examples ; 










DELETE 


SAVED/Q 


- delete 


all 


saved queries 


DELETE 


SAVED/A 


- delete 


all 


saved answer sets 


DELETE 


SAVED /L 


- delete 


all 


saved L-number lists 


DELETE 


SAVED 


- delete 


all 


saved queries, answer sets. 






and L-number lists 


DELETE 


SAVED /S 


- delete 


all 


SDI requests 


DELETE 


SAVED /B 


- delete 


all 


batch requests 


DELETE 


CLUSTER 


- delete 


all 


user-defined clusters 


DELETE 


FORMAT 


- delete 


all 


user-defined display format; 


DELETE 


FIELD 


- delete 


all 


user-defined search fields 


DELETE 


SELECT 


- delete 


all 


E-numbers 


DELETE 


HISTORY 


- delete 


all 


L-numbers and restart the 






session at 


LI 



To delete an entire multifile SDI request, enter DELETE and 
the name of the request. To delete a component from the 
multifile SDI, enter DELETE and the name of the component. 

=> dup rem 14 

DUPLICATE IS NOT AVAILABLE IN 'GENBANK'. 

ANSWERS FROM THESE FILES WILL BE CONSIDERED UNIQUE 

PROCESSING COMPLETED FOR L4 

L5 2 7 DUP REM L4 (0 DUPLICATES REMOVED) 

=> d 15 1-27 

L5 ANSWER 1 OF 27 USPATFULL on STN 
AN 2007:296111 USPATFULL 

TI Lactobacillus acidophillus nucleic acid sequences encoding cell surface 

protein homologues and uses therefore 
IN Klaenhammer, Todd R., Raleigh, NC, UNITED STATES 

Altermann, Eric, Palmerston North, NEW ZEALAND 

Buck, B. Logan, Banner Elk, NC, UNITED STATES 

Russell, W. Michael, Newburgh, IN, UNITED STATES 
PA North Carolina State University, Raleigh, NC, UNITED STATES (U.S. 

corporation) 
PI US 20070258955 Al 20071108 

AI US 2007-701335 Al 20070201 (11) 

RLI Division of Ser. No. US 2004-831070, filed on 23 Apr 2004, PENDING 



PRAI US 2003-465621P 
DT Utility 
FS APPLICATION 
LN.CNT 5104 
INCL INCLM: 
INCLS : 



20030425 (60) 



424/093.400 
435/252.100; 
435/070.100; 
NCL NCLM: 424/093.400 
435/006.000; 
435/320.100; 
A61K0035-00 
C07H0021-02 
C07K0004-00 
C12P0021-04 
A61K0035-00 
A61K0038-00 
C07H0021-00 
C07K0004-00 
C07K0014-335 
C12N0001-20 
C12N0001-21 
C12N0015-00 
C12P0021-04 
C12Q0001-68 
CAS INDEXING IS AVAILABLE 



NCLM: 
NCLS: 



IPCI 



IPCR 



435/252.900; 435/320. 
514/002.000; 530/300. 



435/007.100; 435/070 
514/002.000; 530/300 



[I, A] ; A61K0038-00 
[I, A]; C07H0021-00 
C12N0001-20 
C12Q0001-68 
A61K0035-00 
A61P0001-00 
C07H0021-02 
C07K0004-00 

C07K0016-00 [ 
C12N0001-20 [I 
[I, A]; C12N0009-00 
[I,C]; C12N0015-00 
[I, A]; C12P0021-06 
[I,C]; C12Q0001-68 
FOR THIS PATENT. 



[I, A] ; 
[I, A] ; 

[I,C] ; 
[I,A] ; 
[I,C] ; 
[I,C] ; 

[I,A] ; 
[I,C] ; 



100; 435/006.000; 435/007.100; 
000; 530/387.100; 536/023.100 

100; 435/252.100; 435/252.900; 

000; 530/387.100; 536/023.100 
,A]; A61P0001-00 [I, A]; 
,C*]; C07K0016-00 [I, A]; 
,A]; C12N0015-00 [I, A]; 

A] 

A] ; A61K0038-00 [I,C] ; 
,C] ; A61P0001-00 [I, A] ; 

A] ; C07H0021-04 [I, A] ; 
,A]; C07K0014-195 [I,C*]; 
I,C] ; C07K0016-00 [I, A] ; 
, A] ; C12N0001-21 [I,C*] ; 
,C*]; C12N0009-00 [I, A]; 
,A]; C12P0021-04 [I,C]; 
,C*]; C12P0021-06 [I, A]; 
,A] 



L5 



PA 



ANSWER 2 OF 2 7 USPATFULL on STN 
2007:197155 USPATFULL 

Lactobacillus acidophilus nucleic acid sequences encoding cell surface 

protein homologues and uses therefore 

Klaenharamer, Todd R., Raleigh, NC, UNITED STATES 

Altermann, Eric, Palmer ston North, NEW ZEALAND 

Buck, B. Logan, Banner Elk, NC, UNITED STATES 

Russell, W. Michael, Newburgh, IN, UNITED STATES 

North Carolina State University, Raleigh, NC, UNITED STATES, 27695-8210 
(U.S. corporation) 

Al 20070726 
Al 20070201 (11) 

US 2004-831070, filed on 23 Apr 2004, PENDING 
20030425 (60) 



PI US 20070172495 

Al US 2007-701319 

RLI Division of Ser 
PRAI US 2003-465621P 
DT Utility 
FS APPLICATION 
LN.CNT 5104 
INCL INCLM 
INCLS 



NCL 



IC 



NCLM: 
NCLS : 



IPCI 



424/234.100 
514/044.000; 
435/471.000; 
424/234.100 
435/006.000; 
514/044.000; 



435/006.000; 
530/350.000; 



435/007.320; 
530/350.000; 



435/007.320; 
536/023.700 



435/069.100; 
536/023.700 



435/069.100; 435/252.900; 



435/252.900; 435/471.000; 



A61K0048-00 [I, A]; A61K0039-02 [I, A]; C12Q0001-68 [I, A]; 



G01N0033-554 [I, A]; 

C12P0021-06 [I, A]; 



C07H0021-00 [I,C*]; 

C07K0014-195 [I,C*] 
A61K0039-02 [I,C]; 



C07H0021-04 [I, A]; 
C07K0014-335 [I, A]; 
IPCR A61K0048-00 [I,C]; A61K0048-00 [I, A]; 

A61K0039-02 [I, A]; C07H0021-00 [I,C]; C07H0021-04 [I, A]; 
C07K0014-195 [I,C]; C07K0014-335 [I, A]; C12N0001-21 [I,C*]; 
C12N0001-21 [I, A]; C12N0009-00 [I,C*]; C12N0009-00 [I, A]; 
C12P0021-06 [I,C]; C12P0021-06 [I, A]; C12Q0001-68 [I,C]; 
C12Q0001-68 [I, A]; G01N0033-554 [I,C]; G01N0033-554 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L5 ANSWER 3 OF 27 USPATFULL on STN 
AN 2007:140436 USPATFULL 



TI Delivery of trefoil peptides 

IN Hans, Wolfgang Christian, Landshut, GERMANY, FEDERAL REPUBLIC OF 

Steidler, Lothar, Lokeren, BELGIUM 
Remaut, Erik Rene, Lovendegem, BELGIUM 

PI US 20070122427 Al 20070531 

AI US 2007-654879 Al 20070118 (11) 

RLI Division of Ser. No. US 2002-30390, filed on 16 Apr 2002, PENDING A 371 

of International Ser. No. WO 2000-EP6343, filed on 5 Jul 2000 
PRAI EP 1999-870143 19990705 
DT Utility 
FS APPLICATION 
LN.CNT 1335 

INCL INCLM: 424/200.100 

INCLS: 435/252.300; 435/252.900 
NCL NCLM: 424/200.100 

NCLS: 435/252.300; 435/252.900 
IC IPCI A61K0039-02 [I, A]; C12N0001-21 [I, A] 

IPCR A61K0039-02 [I,C]; A61K0039-02 [I, A]; C12N0001-21 [I,C]; 
C12N0001-21 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 

L5 ANSWER 4 OF 27 USPATFULL on STN 
AN 2007:134502 USPATFULL 

TI Corynebacterium glutamicum genes encoding metabolic pathway proteins 

IN Pompejus, Markus, Freinsheim, GERMANY, FEDERAL REPUBLIC OF 

Kroger, Burkhard, Limburgerhof , GERMANY, FEDERAL REPUBLIC OF 
Schroder, Hartwig, Nussloch, GERMANY, FEDERAL REPUBLIC OF 
Zelder, Oskar, Speyer, GERMANY, FEDERAL REPUBLIC OF 
Haberhauer, Gregor, Limburgerhof, GERMANY, FEDERAL REPUBLIC OF 

PA BASF Aktiengesellschaft, Ludwigshaf en, GERMANY, FEDERAL REPUBLIC OF 

(non-U. S. corporation) 

PI US 20070117183 Al 20070524 

AI US 2006-511140 Al 20060828 (11) 

RLI Division of Ser. No. US 2005-55822, filed on 11 Feb 2005, PENDING 

Continuation of Ser. No. US 2000-606740, filed on 23 Jun 2000, ABANDONED 
PRAI DE 1999-19932125 19990709 

DE 1999-19932227 19990709 

DE 1999-19932228 19990709 

DE 1999-19932230 19990709 

DE 1999-19933005 19990714 

DE 1999-19933006 19990714 

DE 1999-19940764 19990827 

DE 1999-19940766 19990827 

DE 1999-19940832 19990827 

DE 1999-19941378 19990831 

DE 1999-19941379 19990831 

DE 1999-19942077 19990903 

DE 1999-19942079 19990903 

DE 1999-19931418 19990708 

DE 1999-19932126 19990709 

DE 1999-19932229 19990709 

DE 1999-19941396 19990831 

DE 1999-19942087 19990903 

DE 1999-19930476 19990701 

DE 1999-19931419 19990708 

DE 1999-19931420 19990708 

DE 1999-19932206 19990709 

DE 1999-19942088 19990903 

DE 1999-19942124 19990903 

DE 1999-19932928 19990714 

DE 1999-19931415 19990708 

DE 1999-19931424 19990708 



DE 


1999- 


-19931428 


19990708 


DE 


1999- 


-19931434 


19990708 


DE 


1999- 


-19931435 


19990708 


DE 


1999- 


-19931443 


19990708 


DE 


1999- 


-19931453 


19990708 


DE 


1999- 


-19931457 


19990708 


DE 


1999- 


-19931465 


19990708 


DE 


1999- 


-19931478 


19990708 


DE 


1999- 


-19931510 


19990708 


DE 


1999- 


-19931541 


19990708 


DE 


1999- 


-19931573 


19990708 


DE 


1999- 


-19931592 


19990708 


DE 


1999- 


-19931632 


19990708 


DE 


1999- 


-19931634 


19990708 


DE 


1999- 


-19931636 


19990708 


DE 


1999- 


-19932130 


19990709 


DE 


1999- 


-19932186 


19990709 


DE 


1999- 


-19932922 


19990714 


DE 


1999- 


-19932926 


19990714 


DE 


1999- 


-19933004 


19990714 


DE 


1999- 


-19940765 


19990827 


DE 


1999- 


-19941380 


19990831 


DE 


1999- 


-19941394 


19990831 


DE 


1999- 


-19942076 


19990903 


DE 


1999- 


-19942086 


19990903 


DE 


1999- 


-19942095 


19990903 


DE 


1999- 


-19942129 


19990903 


US 


1999- 


-141031P 


19990625 


US 


1999- 


-142101P 


19990702 


US 


1999- 


-148613P 


19990812 


us 


2000- 


-187970P 


20000309 


utility 







DT 

FS APPLICATION 
LN.CNT 8386 
INCL INCLM: 
INCLS: 



NCL NCLM: 
NCLS: 



435/069.100 
435/106.000; 
536/023.200 
435/069.100 
435/106.000; 
536/023.200 



435/115.000; 435/193.000; 435/252.300; 435/471.000; 
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through competitive ELISA. Each reacted as follows, on plates coated with 
Gb3, followed by performing ELISA using mAb against VT. 
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Zelder, Oskar, Speyer, GERMANY, FEDERAL REPUBLIC OF 
Haberhauer, Gregor, Limburgerhof, GERMANY, FEDERAL REPUBLIC OF 
BASF Aktiengesellschaft, Ludwigshaf en, GERMANY, FEDERAL REPUBLIC OF, 
-67056 (non-U. S. corporation) 



RLI 
PRAI 



20050191732 
US 2003-721922 
Continuation of Ser 



20050901 
20031124 (10) 
. US 2000-603124, filed on 23 Jun 2000, ABANDONED 



DE 


1999- 


19931418 


19990708 


DE 


1999- 


19932124 


19990709 


DE 


1999- 


19932126 


19990709 


DE 


1999- 


19932127 


19990709 


DE 


1999- 


19932133 


19990709 


DE 


1999- 


19932207 


19990709 


DE 


1999- 


19932208 


19990709 


DE 


1999- 


19932225 


19990709 


DE 


1999- 


19932229 


19990709 


DE 


1999- 


19932914 


19990709 


DE 


1999- 


19933006 


19990709 


DE 


1999- 


19940765 


19990827 


DE 


1999- 


19940768 


19990827 


DE 


1999- 


19940831 


19990827 


DE 


1999- 


19940832 


19990827 


DE 


1999- 


19941385 


19990831 


DE 


1999- 


19941396 


19990831 


DE 


1999- 


19942087 


19990903 


US 


1999- 


141031P 


19990625 


US 


1999- 


143694P 


19990714 


US 


1999- 


151778P 


19990831 


Utility 







DT 

FS APPLICATION 
LN.CNT 6552 

INCL INCLM: 435/106.000 

INCLS: 435/006.000; 435/069.100; 435/193.000; 435/252.300; 435/320.100; 
536/023.200 



NCLM: 


435/106.000 


NCLS: 


435/006.000 




536/023.200 


[7] 
I CM 


C12Q001-68 


ICS 


C07H021-04; 


IPCI 


C12Q0001-68 




C12P0013-04 




C12N0001-21 


IPCR 


C07H0021-00 




C07K0014-34 




C12N0009-00 




C12N0009-10 




C12P0001-04 




C12P0013-04 


INDEXING 


IS AVAILABLE 



C12P013-04; C12N009-10; C12N001-21; C12N015-74 
[ICM,7]; C07H0021-04 [ICS, 7]; C07H0021-00 [ICS,7,C*]; 
[ICS, 7]; C12P0013-00 [ICS,7,C*]; C12N0009-10 [ICS,7]; 
[ICS, 7]; C12N0015-74 [ICS, 7] 

C07K0014-195 [I,C*]; 



[I,C*]; C12N0001-21 



[I,A]; C12Q0001-68 [I,C*] 



C12P0013-00 
C12Q0001-68 



I,A] ; 
I,C*] 
I, A] ; 



IN 



ANSWER 15 OF 27 USPATFULL on STN 
2005:158196 USPATFULL 

Nucleic acid and amino acid sequences relating to streptococcus 
pneumoniae for diagnostics and therapeutics 
Doucette-Stamm, Lynn A., Framingham, MA, UNITED STATES 
Bush, David, Somerville, MA, UNITED STATES 



US 20050136404 
US 2003-617320 
Division of Ser 
US 1997-51553P 
US 1998-85131P 
DT Utility 
FS APPLICATION 
LN.CNT 12957 
INCL INCLM: 
INCLS: 
NCLM: 
NCLS: 
[7] 
ICM 
ICS 
IPCI 



RLI 
PRAI 



NCL 



435/006.000 
435/069.100; 
435/006.000 
435/069.100; 



20050623 
Al 20030710 (10) 

. US 1998-107433, filed on 30 Jun 1998, PENDING 

19970702 (60) 
19980512 (60) 



435/252.300; 435/320.100; 530/350.000; 536/023.700 
435/252.300; 435/320.100; 530/350.000; 536/023.700 



C12Q001-68 

C07H021-04; C12N001-21; C07K014-315 

C12Q0001-68 [ICM, 7]; C07H0021-04 [ICS, 7]; C07H0021-00 [ICS,7,C*] 
C12N0001-21 [ICS, 7]; C07K0014-315 [ICS, 7]; C07K0014-195 
[ICS, 7,C*] 

IPCR C07H0021-00 [I,C*]; C07H0021-02 [I, A]; C07H0021-04 [I, A]; 
C07K0014-195 [I,C*]; C07K0014-315 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L5 ANSWER 16 OF 27 USPATFULL on STN 
AN 2005:131152 USPATFULL 

TI Lactobacillus acidophilus nucleic acid sequences encoding cell surface 

protein homologues and uses therefore 
IN Klaenhammer, Todd R., Raleigh, NC, UNITED STATES 

Alterman, Eric, Apex, NC, UNITED STATES 

Buck, B. Logan, Banner Elk, NC, UNITED STATES 

Russell, W. Michael, Newburg, IN, UNITED STATES 



20050526 
B2 20080325 
Al 20040423 (10) 
20030425 (60) 



PI US 20050112612 

US 7348420 
Al US 2004-831070 

PRAI US 2003-465621P 
DT Utility 
FS APPLICATION 
LN.CNT 6100 

INCL INCLM: 435/006.000 

INCLS: 435/069.100; 435/183.000 



435/252.300; 435/320.100; 530/350.000; 



536/023.200 
NCL NCLM: 536/023.100; 435/006.000 

NCLS: 435/069.100; 435/252.300; 435/320.100; 435/183.000; 530/350.000; 
536/023.200 

IC [7] 

ICM C12Q001-68 

ICS C07H021-04; C12P021-06; C12N009-00; C12N001-21; C07K014-335 

IPCI C12Q0001-68 [ICM, 7]; C07H0021-04 [ICS, 7]; C07H0021-00 [ICS,7,C*]; 

C12P0021-06 [ICS, 7]; C12N0009-00 [ICS, 7]; C12N0001-21 [ICS, 7]; 

C07K0014-335 [ICS, 7]; C07K0014-195 [ICS,7,C*] 
IPCI-2 C07H0021-04 [I, A]; C07H0021-00 [I,C*]; C12N0001-20 [I, A]; 

C12N0015-00 [I, A]; C12P0001-20 [I, A] 
IPCR C07H0021-00 [I,C]; C07H0021-04 [I, A]; C07K0014-195 [I,C*]; 

C07K0014-335 [I, A]; C12N0001-20 [I,C]; C12N0001-20 [I, A]; 

C12N0001-21 [I,C*]; C12N0001-21 [I, A]; C12N0009-00 [I,C*]; 

C12N0009-00 [I, A]; C12N0015-00 [I,C]; C12N0015-00 [I, A]; 

C12P0021-06 [I,C*]; C12P0021-06 [I, A]; C12Q0001-68 [I,C*]; 

C12Q0001-68 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L5 ANSWER 17 OF 27 USPATFULL on STN 
AN 2004:95339 USPATFULL 

TI Nutritional formulations containing synbiotic substances 

IN Kaup, Susan M., Collingswood, NJ, UNITED STATES 

Wilson, Jeffrey L., Doylestown, PA, UNITED STATES 
Kostek, Beverley M., Glen Mills, PA, UNITED STATES 
Frantz, David C, Pottstown, PA, UNITED STATES 

PA Wyeth, Madison, NJ, UNITED STATES, 07940 (U.S. corporation) 

PI US 20040072794 Al 20040415 

AI US 2003-681658 Al 20031008 (10) 

PRAI US 2002-418109P 20021011 (60) 

DT Utility 

FS APPLICATION 

LN.CNT 542 

INCL INCLM: 514/054.000 

INCLS: 424/093.450 
NCL NCLM: 514/054.000 

NCLS: 424/093.450 
IC [7] 

ICM A61K045-00 

ICS A61K031-715 

IPCI A61K0045-00 [ICM, 7]; A61K0031-715 [ICS, 7] 

IPCR A23L0001-052 [I,C*]; A23L0001-0528 [I, A]; A23L0001-29 [I, 
A23L0001-29 [I, A]; A23L0001-30 [I,C*]; A23L0001-30 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L5 ANSWER 18 OF 2 7 USPATFULL on STN 
AN 2004:12970 USPATFULL 

TI Polynucleotides, materials incorporating them, and methods for using 

them 

IN Glenn, Matthew, Whenuapai, NEW ZEALAND 

Havukkala, Ilkka J., Remuera, NEW ZEALAND 

Lubbers, Mark, Palmer ston North, NEW ZEALAND 

Dekker, James, Palmerston North, NEW ZEALAND 
PA GENESIS RESEARCH AND DEVELOPMENT CORP. LTD., Parnell, NEW ZEALAND 

(non-U. S. corporation) 
PI US 20040009490 Al 20040115 

US 7125698 B2 20061024 

AI US 2002-264213 Al 20021003 (10) 

RLI Continuation-in-part of Ser. No. US 2001-971536, filed on 2 Oct 2001, 

PENDING Continuation-in-part of Ser. No. US 2000-634238, filed on 8 Aug 
2000, GRANTED, Pat. No. US 6544772 



PRAI US 1999-147853P 
US 1999-147852P 
US 1999-152032P 
US 1999-152031P 
DT Utility 
FS APPLICATION 
LN.CNT 5375 
INCL INCLM 
INCLS 
NCLM: 
NCLS: 



19990809 (60) 

19990809 (60) 

19990901 (60) 

19990901 (60) 



NCL 



IC 



435/006.000 
435/069.100; 
435/193.000; 
426/534.000; 
435/252.300; 



[7] 
I CM 
ICS 
IPCI 



435/252.300; 
435/006.000 
435/183.000; 
435/320.100; 536/023.200 



435/320.100; 530/350.000; 536/023.200 
435/194.000; 530/350.000; 435/069.100; 



C12Q001-68 

C07H021-04; C12P021-02; C12N001-21; C12N015-74; C07K014-335 
C12Q0001-68 [ICM,7]; C07H0021-04 [ICS, 7]; C07H0021-00 [ICS,7,C* 
C12P0021-02 [ICS, 7]; C12N0001-21 [ICS, 7]; C12N0015-74 [ICS, 7]; 
C07K0014-335 [ICS, 7]; C07K0014-195 [ICS,7,C*] 
IPCI-2 C12N0009-10 [I, A] 

IPCR C12N0009-10 [I,C]; C12N0009-10 [I, A]; A23C0019-00 [I,C*]; 

A23C0019-032 [I, A]; A61K0038-00 [N,C*]; A61K0038-00 [N,A]; 
C07K0014-195 [I,C*]; C07K0014-335 [I, A]; C12N0001-21 [I,C*]; 
C12N0001-21 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L5 ANSWER 19 OF 27 USPATFULL on STN 
AN 2004:250212 USPATFULL 

TI Nucleic acid and amino acid sequences relating to Streptococcus 

pneumoniae for diagnostics and therapeutics 
IN Doucette-Stamm, Lynn A., Framingham, MA, United States 

Bush, David, Somerville, MA, United States 
PA Genome Therapeutics Corporation, Waltham, MA, United States (U.S. 

corporation ) 
PI US 6800744 Bl 20041005 

US 1998-107433 19980630 (9) 

US 1998-85131P 19980512 (60) 

US 1997-51553P 19970702 (60) 

DT Utility 
FS GRANTED 
LN.CNT 11545 
INCL INCLM: 
INCLS: 



PRAI 



NCL 



IC 



NCLM: 
NCLS: 



536/023.100 

435/006.000; 
536/024.100; 
536/023.100 
435/006.000; 
536/024.100; 



435/320.100; 
536/023.400; 



435/325.000; 
536/024.320 



435/254.000; 435/419.000; 



435/320.100) 
536/024.320 



435/325.000; 435/419.000; 536/023.400; 



[7] 
ICM 
ICS 



C12Q001-68 

C12N001-14; C12N015-00; C12N005-00; C12N005-04; C07H021-02; 
C07H021-04 

C12Q0001-68 [ICM, 7]; C12N0001-14 [ICS, 7]; C12N0015-00 [ICS, 7]; 
C12N0005-00 [ICS, 7]; C12N0005-04 [ICS, 7]; C07H0021-02 [ICS, 7]; 

C07H0021-04 [ICS, 7]; C07H0021-00 [ICS,7,C*] 
C07H0021-00 [I,C*]; C07H0021-02 [I, A]; C07H0021-04 [I, A]; 
C07K0014-195 [I,C*]; C07K0014-315 [I, A] 
EXF 536/23.1; 536/23.4; 536/24.32; 435/320.1; 435/325; 435/254.11; 435/419; 

435/6 

CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



IPCI 



IPCR 



L5 ANSWER 20 OF 27 USPATFULL on STN 
AN 2003:71519 USPATFULL 

TI Corynebacterium glutamicum genes encoding metabolic pathway proteins 



Pompejus, Markus, Freinsheim, GERMANY, FEDERAL REPUBLIC OF 

Kroger, Burkhard, Limburgerhof , GERMANY, FEDERAL REPUBLIC OF 

Schroder, Hartwig, Nussloch, GERMANY, FEDERAL REPUBLIC OF 

Zelder, Oskar, Speyer, GERMANY, FEDERAL REPUBLIC OF 

Haberhauer, Gregor, Limburgerhof, GERMANY, FEDERAL REPUBLIC OF 

Kim, Jun-Won, Seoul, KOREA, REPUBLIC OF 

Lee, Heung-Shick, Seoul, KOREA, REPUBLIC OF 

Hwang, Byung-Joon, Seoul, KOREA, REPUBLIC OF 

US 20030049804 Al 20030313 

US 2000-746660 Al 20001222 (9) 

Continuation-in-part of Ser. No. US 2000-606740, filed on 23 Jun 2000, 
PENDING Continuation-in-part of Ser. No. US 2000-603124, filed on 23 Jun 
2000, PENDING 

19990708 
19990625 (60) 
19990702 (60) 
19990812 (60) 
20000309 (60) 



PRAI DE 1999-19931420 
US 1999-141031P 
US 1999-142101P 
US 1999-148613P 
US 2000-1B7970P 

DT Utility 

FS APPLICATION 

LN.CNT 15004 

INCL INCLM: 
INCLS: 

NCL NCLM: 



IC 



NCLS: 
[7] 
ICM 
ICS 
IPCI 



435/115.000 

435/069.100; 435/252.300; 435/320.100; 435/183.000; 536/023.200 
435/115.000 

435/069.100; 435/183.000; 435/252.300; 435/320.100; 536/023.200 



C12P013-08 

C07H021-04; C12N009-00; C12P021-02; C12N001-21 

ICM, 7]; C12P0013-00 [ICM, 7,C*]; C07H0021-04 [ICS, 7]; 

C12N0009-00 [ICS, 7]; C12P0021-02 [ICS, 7]; 



C12P0013-08 
C07H0021-00 [ICS,7,C*] 
C12N0001-21 [ICS, 7] 
IPCR C07K0014-195 [I,C*]; C07K0014-34 [I, A]; C12N0001-21 [I,C*]; 
C12N0001-21 [I, A]; C12N0009-00 [I,C*]; C12N0009-00 [I, A]; 
C12N0009-18 [I,C*]; C12N0009-18 [I, A]; C12N0009-90 [I,C*]; 
C12N0009-90 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L5 ANSWER 21 OF 27 USPATFULL on STN 
AN 2003:95966 USPATFULL 

TI Polynucleotides, materials incorporating them, and methods for using 

them 

IN Glenn, Matthew, Auckland, NEW ZEALAND 

Havukkala, Ilkka J., Auckland, NEW ZEALAND 

Bloksberg, Leonard N., Auckland, NEW ZEALAND 

Lubbers, Mark W., Palmer ston North, NEW ZEALAND 

Dekker, James, Palmerston North, NEW ZEALAND 

Christensson, Anna C, Lund, SWEDEN 

Holland, Ross, Palmerson North, NEW ZEALAND 

0' Toole, Paul W., Palmerston North, NEW ZEALAND 

Reid, Julian R., Palmerston North, NEW ZEALAND 

Coolbear, Timothy, Palmerston North, NEW ZEALAND 
PA Genesis Research & Development Corp. Ltd, Parnell, NEW ZEALAND (non-U. S. 

corporation ) 

Via Lachia Bioscience (NZ) Ltd., Auckland, NEW ZEALAND (non-U. S. 

corporation) 
PI US 6544772 Bl 20030408 

Al US 2000-634238 20000808 (9) 

DT Utility 
FS GRANTED 
LN.CNT 2015 

INCL INCLM: 435/252.300 

INCLS: 435/320.100; 536/023.700 



NCL NCLM: 435/252.300 

NCLS: 435/320.100; 536/023.700 
IC [7] 

ICM C12N001-21 

ICS C12N015-63; C12N015-31 

IPCI C12N0001-21 [ICM, 7]; C12N0015-63 [ICS, 7]; C12N0015-31 [ICS, 7] 
IPCR A23C0019-00 [I,C*]; A23C0019-032 [I, A]; C07K0014-195 [I,C*]; 
C07K0014-335 [I, A]; C12N0001-21 [I,C*]; C12N0001-21 [I,A]; 
C12N0015-31 [I,C*]; C12N0015-31 [I, A] 
EXF 435/252.3; 435/320.1; 536/23.7 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 

L5 ANSWER 22 OF 27 BIOTECHDS COPYRIGHT 2008 THOMSON REUTERS on STN 
AN 1999-00562 BIOTECHDS 

TI Use of lactic and propionic acid bacteria; 

to bind mycotoxin to prevent their absorption or to purify 

contaminated food or feedstuff 
AU El-Nezami H; Kankaanpaa P; Salminen S; Ahokas J 
PA Roy.Melbourne-Inst.Technol. 
LO Melbourne, Victoria, Australia. 
PI WO 9834503 13 Aug 1998 
AI WO 1998-AU63 6 Feb 1998 
PRAI AU 1997-5005 7 Feb 1997 
DT Patent 
LA English 

OS WPI: 1998-557001 [49] 

L5 ANSWER 23 OF 27 GENBANK® COPYRIGHT 2008 on STN 

LOCUS (LOC) : CP000705 GenBank (R) 

GenBank ACC. NO. (GBN) : CP000705 AAOVOIOOOOOO AAOV01000001-AAOV01000097 

GenBank VERSION (VER) : CP000705.1 GI:148530277 

SEQUENCE LENGTH (SQL) : 1999618 

MOLECULE TYPE (CI): DNA; circular 

DIVISION CODE (CI): Bacteria 

DATE (DATE) : 17 Oct 2007 

DEFINITION (DEF) : Lactobacillus reuteri F275, 

complete genome. 
SOURCE: Lactobacillus reuteri F275 

ORGANISM (ORGN) : Lactobacillus reuteri F275 

Bacteria; Firmicutes; Lactobacillales ; 
Lactobacillaceae; Lactobacillus 
PROJECT (PJID) : GenomeProject : 15766 

COMMENT : 

URL — http://www.jgi.doe.gov 
JGI Project ID: 4000135 

Source DNA available from Gerald Tannock 
(gerald . tannockgstonebow. otago . ac . nz ) 
Bacteria available from DSMZ: DSM 20016 

Contacts: Gerald Tannock (gerald. tannockgstonebow. otago. ac.nz) 
Paul Richardson (microbes@cuba.jgi-psf.org) 

Quality assurance done by JGI-Stanford 
Annotation done by JGI-ORNL and JGI-PGF 
Finishing done by JGI-PGF 

Finished microbial genomes have been curated to close all gaps with 
greater than 98% coverage of at least two independent clones. Each 
base pair has a minimum q (quality) value of 30 and the total error 
rate is less than one per 50000. 

The JGI and collaborators endorse the principles for the 
distribution and use of large scale sequencing data adopted by the 
larger genome sequencing community and urge users of this data to 
follow them. It is our intention to publish the work of this 



project in , 
interaction 
(http://www 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 

JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO) : 



1 timely fashion and we welcome collaborative 
on the project and analysis. 
. genome . gov/page . cfm?pageID=l 05063 76 ) . 

1 (bases 1 to 1999618) 

Copeland,A.; Lucas, S.; Lapidus,A.; Barry, K.; 
Detter, J.C. ; Glavina del Rio,T.; Hammon,N.; Israni,S. 
Dalin,E.; Tice,H.; Pitluck,S.; Goltsman, E . ; Schmutz,J 
Larimer, F.; Land,M.; Hauser,L.; Kyrpides,N.; Kim,E.; 
Walter, J.; Hang, N . C . K . ; Tannock, G.W. ; Richardson,?. 
Complete sequence of chromosome of 
Lactobacillus reuteri DSM 20016 
Unpublished 

2 (bases 1 to 1999618) 
Copeland,A.; Lucas, S.; Lapidus, 
Detter, J.C. ; Glavina del Rio,T 
Dalin,E.; Tice,H.; Pitluck,S.; Goltsman, E. 
Larimer, F.; Land,M.; Mauser, L.; Kyrpides,N 
Walter, J.; Heng,N.C.K. ; Tannock, G.W 
Direct Submission 

Submitted (17-MAY-200 7) US DOE Joint Genome Institute 
2800 Mitchell Drive BlOO, Walnut Creek, CA 94598-1698 
USA 



,.; Barry, K 
Hammon, N . 



Israni, S . ; 
Schmutz, J. , 
Kim,E. ; 
Richardson, P . 



FEATURES (FEAT) : 
Feature Key 



Qualifier 



1. .1999618 



/organism=" Lactobacillus reuteri 
F275" 

/mol-type="genomic DNA" 
/strain="DSM 20016" 
/db-xref="taxon:299033" 
/note="F275 = DSM 20016 = JCM 
1112" 
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GENBANK® COPYRIGHT 2008 on STN 



LOCUS (LOC) : 
GenBank ACC. NO. (GBN) : 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 
SOURCE: 
ORGANISM (ORGN) : 



REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



GenBank (R) 
GI:134084136 



AM270397 
AM270397 
AM270397.1 
928607-03-8 
163680 
DNA; linear 
Plants, fungi, algae 
2 4 Mar 2007 

Aspergillus niger contig Anl8c0040, complete genome. 

Aspergillus niger 
Aspergillus niger 

Eukaryota; Fungi; Ascomycota; Pezizomycotina; 
Eurotiomycetes; Eurotiales; Trichocomaceae; mitosporic 
Trichocomaceae; Aspergillus 

1 (bases 150631 to 153085) 
Habison,A.; Kubicek, C . P . ; Rohr,M. 

Partial purification and regulatory properties of 
phosphof ructokinase from Aspergillus niger 
Biochem. J., 209 (3), 669-676 (1983) 
CA 99:171737 

2 (bases 71863 to 73356) 
Manney, T.R. 

Expression of the BARl gene in Saccharomyces 
cerevisiae: induction by the alpha mating pheromone of 
an activity associated with a secreted protein 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 

REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



J. Bacteriol., 155 (1), 291-301 (1983) 
CA 99:102121 

3 (bases 71863 to 73356) 
Jones, E . W. 

The synthesis and function of proteases in 
Saccharomyces : genetic approaches 
Annu. Rev. Genet., 18, 233-270 (1984) 
CA 102:92538 

4 (bases 144032 to 145578) 
Brandsch, R. ; Bichler,V. 

In vivo and in vitro expression of the 
6-hydroxy-D-nicotine oxidase gene of Arthrobacter 
oxidans, cloned into Escherichia coli, as an 
enzymatically active, covalently flavinylated 
polypeptide 

FEBS Lett., 192 (2), 204-208 (1985) 
CA 104:29597 

5 (bases 150631 to 153085) 
Schreferl,G. ; Kubicek, C . P . ; Rohr,M. 

Inhibition of citric acid accumulation by manganese 

ions in Aspergillus niger mutants with reduced citrate 

control of phosphof ructokinase 

J. Bacterid., 165 (3), 1019-1022 (1986) 

CA 104:165023 

6 (bases 156827 to 158638) 
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an enzyme with an unusual pH-dependent activity 
Protein Sci., 10 (5), 911-922 (2001) 

63 (bases 43783 to 45994) 
Ziegler,M.; Oei,S.L. 

A cellular survival switch: poly ( ADP-ribosyl ) ation 
stimulates DNA repair and silences transcription 
Bioessays, 23 (6), 543-548 (2001) 

64 (bases 43783 to 45994) 
Herceg,Z.; Wang,Z.Q. 

Functions of poly (ADP-ribose) polymerase (PARP) in DNA 
repair, genomic integrity and cell death 
Mutat. Res., 477 (1-2), 97-110 (2001) 
CA 136:145570 



REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



65 (bases 15614 to 17392) 

Mundodi, S .R. ; Watson, B.S.; Lopez-Meyer , M. ; Paiva,N.L. 
Functional expression and subcellular localization of 
the Nectria haematococca Makl phytoalexin 
detoxification enzyme in transgenic tobacco 
Plant Mol. Biol., 46 (4), 421-432 (2001) 
CA 135:285870 

66 (bases 1 to 163680) 

Pel, H.J. ; de Winde, J.H. ; Archer , D . B . ; Dyer,P.S.; 
Hofmann, G. ; Schaap, P . J . ; Turner, G.; de Vr ies , R . P . ; 
Albang,R.; Albermann, K . ; Andersen, M. R. ; Bendtsen, J . D . ; 
Benen,J.A.; van den Berg,M.; Breestraat, S . ; 
Caddick,M.X. ; Contreras, R. ; Cornell, M.; Coutinho, P . M. ; 
Danchin, E . G . ; Debets, A. J. ; Dekker,P.; van Dijck,P.W.; 
van Dijk,A.; Di jkhuizen, L . ; Driessen, A. J . ; d'Enfert,C. 
Geysens,S.; Goosen,C.; Groot,G.S.; de Groot,P.W.; 
Guillemette, T . ; Henrissat , B . ; Herwei jer, M. ; van den 
Hombergh, J . P . ; van den Hondel, C . A. ; van der 
Heijden,R.T. ; van der Kaaij,R.M.; Klis,F.M.; 
Kools,H.J.; Kubicek,C.P. ; van Kuyk,P.A.; Lauber,J.; 
Lu,X.; van der Maarel,M.J.; Meulenberg, R. ; Menke,H.; 
Mortimer, M. A. ; Nielsen, J.; Oliver, S.G.; Olsthoorn, M. ; 
Pal,K.; van Peij,N.N.; Ram,A.F.; Rinas,U.; Roubos,J.A. 
Sagt,C.M.; Schmoll,M.; Sun, J.; Ussery,D.; Varga,J.; 
Vervecken, W. ; van de Vondervoort , P . J . ; Wedler,H.; 
Wosten, H.A. ; Zeng,A.P.; van Ooyen,A.J.; Visser,J.; 
Stam,H. 

Genome sequencing and analysis of the versatile cell 
factory Aspergillus niger CBS 513.88 
Nat. Biotechnol., 25 (2), 221-231 (2007) 
CA 146:310276 

67 (bases 1 to 163680) 
Pel, H.J. 

Direct Submission 

Submitted ( Ol-MAY-2006 ) Pel H.J., DSM, 624-0295, P.O. 
Box 1, 2600 MA Delft, THE NETHERLANDS 



FEATURES (FEAT) : 
Feature Key 



exon 
intron 



<41. .>545 
join(<41. .325, 
417. .>545) 

join(41. .325, 417. .545) 



41. .32 
326. .4 



Qualifier 

/organism="Aspergillus niger" 

/mol-type="genomic DNA" 
/db-xref="taxon:5061" 
/clone="Anl8c0 0 40" 
/locus-tag="Anl8g0 092 0" 
/locus-tag="Anl8g0 092 0" 

/locus-tag="Anl8g0092 0" 
/ codon-start=l 

/product="hypothetical protein" 

/protein-id="CAK47170 .1" 

/db-xref="GI:13408 4137" 

/translation="MRGTRLSLRTPYTAKSHYTT 

TGAQKPLHGSLCEATSFVGCSSVW 

LRGISCSTQYKGKLNIHLNLTSGGRWTPIPAGF 

EHGQGRDLTDELANETLLAFSASP 

AASGCRILFFELDLSVLGSSTSVANAAYLQTRCL 

V" 

/locus-tag="Anl8g00920" 
/number=l 

/locus-tag="Anl8g00920" 
/number=l 



gene 
tRNA 



gene 
mRNA 



exon 

intron 

exon 

intron 

intron 
exon 



417. .545 
874. .954 
874. .954 

complement (<1241 . .>3043 
) 

complement (join (<1241 . . 
1273, 1325. . 1814, 
1867. .2175, 2229. .2599, 

2960. .>3043) ) 
complement (join(1241. .1 
273, 1325. .1814, 
1867. .2175, 2229. .2599, 
2960. .3043) ) 



/locus-tag="Anl8g0092 0" 
/number=2 

/gene="tRNA-Arg (CCG) " 

/locus-tag="Anl8e00930" 

/gene="tRNA-Arg (CCG)" 

/locus-tag="Anl8e0 093 0" 

/product="tRNA-Arg" 

/ inf erence="prof ile : tRNAscan : 1 

/note="codon recognized: COG" 

/locus-tag="Anl8g00940" 

/locus-tag="Anl8g00940" 



/locus-tag="Anl8g00940" 



complement (1241. .1273) 
complement (1274. .1324) 
complement (1325. .1814) 

complement (1815. .1866) 
complement (1867. .2175) 
complement (2176 . .2228) 
complement (2229 . .2599) 



/EC-number="l . 1.99.12" 

/inference="profile:COGS:COG2303" 

/inference="prof ile:PFAM:PF05199" 

/note="unnamed protein product; 

Catalytic activity: L-sorbose + 

acceptor <=> 5-dehydro-D-f ructose 

+ reduced acceptor. Cof actor: FAD. 

Title: strong similarity to FAD 

dependent L-sorbose dehydrogenase 

SDH - Gluconobacter oxydans" 

/citation=[34] 

/ codon-start=l 

/protein-id="CAK47171 .1" 

/db-xref="GI : 134084138" 

/translation="MGFVAGFDSDLEWNLRSVPQ 

ERAFGGDTVSHATNSGKPRNYPLR 

EKSRRFHEFLGRKQADDMNSGQQLGYSEAFSSSY 

AGLRSFASSYPLGSNVTLWTKATV 

ERLIMAGRTAEGVAVARAVGDQIERVQVRARKEV 

ILSAGAYGSPKILLLSGIGPAAEL 

KRHN I TPVADLPVGKNYADHPH I FT YWT I DECNA 

TLGDGEMETDQVHWLAGLPYDWMS 

FAPADTKTQEIAKKILSPTERSRYLADGKMQLEN 

FIVYTAVDTSARKKPLEGWPGKRV 

ISLMHVLVDSVSRGTLTLRSSDPADAPVLDPKLL 

AS P VDRS AL YE N VRE T ARA I Q AVE 

GLNAVEFGVDDSLRDNWSNEAIEVRANRSGGSTY 

HYSGTCAMGEVVDTECRVYQAERL 

RWD S G 1 1 P LP L AAH YQAP T YG VAE Q AADM I L S S 

VA" 

/locus-tag="Anl8g00940" 
/number=l 

/locus-tag="Anl8g00940" 
/number=l 

/locus-tag="Anl8g00940" 
/number=2 

/locus-tag="Anl8g0 09 40" 
/number=2 

/locus-tag="Anl8g00940" 
/number=3 

/locus-tag="Anl8g00940" 
/number=3 

/locus-tag="Anl8g00940" 



/number=4 

intron complement ( 2600 .. 2959 ) /locus-tag="Anl8g00940" 

/number=4 

exon complement (2960. .3043) /locus-tag="Anl8g00940" 

/number=5 

gene complement ( <3 78 8 .. >4775 /locus-tag="Anl8g00950" 

) 

mRNA complement ( join (<3788 . . /locus-tag="Anl8g00950" 

4122, 4201 . . 4523, 
4678. .>4775) ) 

CDS complement( join(3788. .4 /locus-tag="Anl8g00950" 

122, 4201. .4523, 
4678. .4775) ) 

/inference="prof lie: COGS: COGO 454" 
/inference="profile: COGS :COG0 456" 
/inf erence="similar to AA 
sequence:UniProtKB:AE010349 .4" 
/note="unnamed protein product; 
Title: weak similarity to 
hypothetical acetyltransf erase 
MK0549 - Methanopyrus kandle" 
/ codon-start=l 
/protein-id="CAK47172.1" 
/db-xref="GI : 134084139" 
/translation="MTKSQFEVREIHTKEEYARL 
VDVLWSANFNPYNPVFTAVHPVSG 
HTAEDRVKDKALDTEIRWAAHEKNPASHLIYVID 
TETGRVAGGCEWLIFHKNPFPNGP 
QPLSCTWYPEGSERAEYASRMLSQAFFPRMCWLQ 
RPHAGVNAMGVHP D YRRRGVGRL L 
MQWGHERIDPLGYESWIEGSPIGRWLYEESGYKR 
VI S LN I DFAKKNP S DEWNRLVHEC 
QPPAILLLWRPPRGEWNDKVPPGPWAVTEDTWK" 
exon complement (3788. .4122) /locus-tag="Anl8g00950" 

/number=l 

intron complement ( 4123 .. 4200 ) /locus-tag="Anl8g00950" 

/number=l 

exon complement (4201. .4523) /locus-tag="Anl8g00950 " 

/number=2 

intron complement ( 4524 .. 4677) /locus-tag="Anl8g00950" 

/number=2 

exon complement (4678. .4775) /locus-tag="Anl8g00950" 

/number=3 

gene complement (<5212 . .>7379 /locus-tag="Anl8g00960" 

) 

mRNA complement ( join (<5212 . . /locus-tag="Anl8g00960" 

5456, 5493 . . 5664, 
5746. .6075, 6140. .6159, 
6225. .6602, 6674. .6746, 
6800. .6899, 
7249. .>7379) ) 

CDS complement(join(5212. .5 /locus-tag="Anl8g00960" 

456,5493. .5664, 
5746. .6075, 6140. .6159, 
6225. .6602, 6674. .6746, 
6800. .6899, 7249. .7379) ) 

/inference= "profile: COGS : COGO 833" 
/inf erence="similar to AA 
sequence :PIR: SI 11 75" 
/note="unnamed protein product; 
Function: Hnml of S. cerevisiae is 
the unique coline permease in 



yeast (Km of 0 . 5 micromolar) . 

Regulation: in S. cerevisiae, 

expression of HNMl gene is 

regulated by the phospholipid 

precursors inositol and choline 

and this regulation involves the 

trans-acting factors Ino2p, Ino4p 

and Opilp. Remark: alternate name 

for S. cerevisiae Hnml is YGL077c. 

Similarity: Hnml of S. cerevisiae 

belongs to the APC family, which 

encompasses amino acid permeases 

more related to the mammalian 

cationic amino-acid transporter 

family (Tea/ecoR) . Title: strong 

similarity to choline permease 

Hnml -Saccharomyces cerevisiae 

plasma membrane" 

/citation=[12] 

/citation=[19] 

/citation=[24] 

/ codon-start=l 

/protein-id="CAK47173 .1" 

/db-xref="GI : 134084140" 

/translation="MARNSDVEHSIELRAPTNRD 

SGTP S KATF S D DAYE L ARVGKKE V 

GGPAGLVYGFLGVWAGII SVFI SMGELASMMPSA 

GGQYHWVSILAPRSARKFLSHVTG 

SVCIIAWTAAPTAAIYLAASVLQSTIAMNIPSYD 

PKGWHITLIMWAILLVCTVLNTWL 

GMILPVIEVLILLVHVLGFFAVLVPLVYLGPKAD 

PRSIFTVSFDYGGWGDLTLATFIG 

LKGTVAAFVGTDGAVHMAEEVANSSRWPRSMLL 

ALMINGATGFAILIAFLFTAGDLL 

KIVESSASYPFMYMLASSTGSKGAAWLSSMMAI 

LQACAGLAGISSGSRMLWSFSREQ 

AIPGWRWVRQVNQRTLVPFHSTLWWAAGLLSL 

INIGSAWLNIILSLVLEAFFASY 

MISLTLLLYRGGVLNWGPFRVKGWLGTANNIFAI 

AYSIIMMFFGCWPPENHPAPKNIN 

YS IVIFAGVTL I S I I YYVGWARKHYQGPLAEL " 



exon 


complement ( 5212 . 


.5456) 


/locus-tag= 
/ number=l 


"Anl8c 


J00960" 


intron 


complement ( 5457 . 


.5492) 


/locus-tag= 
/ number=l 


"AnlBc 


J00960" 


exon 


complement ( 5493 . 


.5664) 


/locus-tag= 
/number=2 


"Anl8c 


J00960" 


intron 


complement ( 5665 . 


.5745) 


/locus-tag= 
/number=2 


"Anl8c 


J00960" 


exon 


complement (5746 . 


.6075) 


/locus-tag= 
/number=3 


"Anl8c 


J00960" 


intron 


complement ( 6076 . 


.6139) 


/locus-tag= 
/number=3 


"AnlBc 


J00960" 


exon 


complement ( 6140 . 


.6159) 


/locus-tag= 
/number=4 


"AnlSc 


J00960" 


intron 


complement ( 6160 . 


.6224) 


/ locus-tag= 


"Anl8g00960" 








/number=4 








complement ( 6225 . 


.6602) 


/locus-tag= 
/ number=5 


"AnlBc 


J00960" 


intron 


complement ( 6603 . 


.6673) 


/locus-tag= 
/number=5 


"AnlBc 


J00960" 


exon 


complement ( 6674 . 


.6746) 


/locus-tag= 


"AnlBc 


J00960" 









/number=6 






intron 


complement (.6141 . 


.6799) 


/locus-tag= 


"Anl8c 


J00960' 








/number=6 






exon 


complement (6800. 


.6899) 


/locus-tag= 


"Anl8c 


J00960' 








/ number=7 






intron 


complement ( 6900 . 


. 7248) 


/locus-tag= 


"Anl8g00960 








/ number =7 






exon 


complement ( 7249 . 


.7379) 


/locus-tag= 


"Anl8c 


J00960' 








/ number=8 






gene 


complement (<7770 


. .>8415 


/locus-tag= 


"Anl8c 


J00970' 


mRNA 


complement (join (<7770 . . 


/locus-tag= 


"Anl8c 


J00970' 




8282, 8380. .>8415) ) 








CDS 


complement (join (7770 . . 8 


/locus-tag= 


"Anl8c 


J00970' 



exon 

intron 

exon 



282, 8380. .8415) ) 



complement (7770. .8282) 

complement (8283. .8379) 

complement (8380. .8415) 

<8621. .>9700 
join(<8621. .8764, 
8830. .9156, 9214. .9356, 
9406. .>9700) 
join(8621. .8764, 
8830. .9156, 9214. .9356, 
9406. .9700) 



/note="unnamed protein product; 

Title: weak similarity to 

hypothetical protein BAA96207.1 - 

Oryza sativa" 

/ codon-start=l 

/protein-id="CAK47174.1" 

/db-xref="GI : 134084141" 

/translation="MLCTHAGGEGARSGRPDDRQ 

GHQPISVALWLCAPPPGFLSGRTS 

LASAGISNPDGATAVNTCETVGCLGLTACGQDCL 

SSRAALAELLLGLCCVGDGGGRAF 

PANPNKMDRYLLLyCRLGLTLDGPIVWWHTLS 

LSCSPPTQLHPPPLLASLHSFFPP 

ESLGRVPI S I YAHMRHAAAKIV" 

/locus-tag="Anl8g00970" 

/number=l 

/locus-tag="Anl8g00970" 
/number=l 

/locus-tag="Anl8g00970" 
/number=2 

/locus-tag="Anl8g00980" 
/locus-tag="Anl8g00980" 



/locus-tag="Anl8g00980" 



/note="unnamed protein product; 
Function: M. grisea Pthll is a 
pathogenicity gene. Function: M. 
grisea Pthllp is likely to be 
involved in host surface 
recognition. Function: M. grisea 
pthll mutants of strain 4091-5-8 
are nonpathogenic due to a defect 
in appressorium differentiation. 
Localization: in M. grisea, a 
Pthll-green fluorescent protein 
fusion localised to the cell 
membrane and vacuoles. Similarity: 
similarity of the predicted A. 
niger protein and M. grisea Pthll 
is limited to the N-terminal half 
of the protein sequences. Title: 
weak similarity to integral 
membrane protein PTHll - 
Magnaporthe grisea plasma 



sig-peptide 
mat -peptide 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

gene 

mRNA 

CDS 



sig-peptide 
mat-peptide 



8621. .8698 

join(8699. .8764, 
8830. .9156,9214. .9356, 
9406. .9697) 

8621. .8764 

8765. .8829 

8830. .9156 

9157. .9213 

9214. .9356 

9357. .9405 



membrane" 

/citation=[52] 

/codon-start=l 

/protein-id="CAK47175. 1" 

/db-xref="GI : 134084142" 

/translation="MTDHSAAVKWTCFLLIVSF 

lAWACLTTNWQVLRRKVSSVALL 

LSTLIASIASGAAVSVAATHGLGQASPLTDAQW 

VMQKALYSMEVLYVLTLGLGKLSV 

MVLFYSLLSSTGQSKSVLAATGLLLIWVWMVIV 

VCLQCHPPEVWNIVGGTCLDLSGI 

WIAFGVMNVLVEIMIIAVPSFIIFRLKLSLKRRL 

WISCFGIRILDIAGSIVQLCYVR 

NFKIHADSPMPTNVWQWAICSQVLQTVAILSACV 

PYLREFLESFPSGMFKPTELKHPT 

VQSAYNATKCSDSDIELMRPESTKDT" 

/locus-tag="Anl8g00980" 

/inference= "protein 

motif :SignalP: 2.0" 

/locus-tag="Anl8g00980" 



/product=" unnamed' 
/locus-tag="Anl8g( 
/number=l 

/locus-tag="Anl8g( 
/number=l 

/locus-tag="Anl8g( 
/number=2 

/locus-tag="Anl8g( 
/number=2 

/locus-tag="Anl8g( 
/number=3 

/locus-tag="Anl8g( 

/number=3 

/locus-tag="Anl8g( 
/ number=4 
/locus-tag="Anl8g( 



0980" 
0980" 
0980" 
0980" 
0980" 
0980" 
0980" 
0990" 



9406. .9700 

complement (<9787. .>9927 
) 

complement (<9 78 7. .>9927 /locus-tag="Anl8g00990 
) 

complement (9787. .9927) 



complement (9877. .9927) 

complement (9790 . .9876) 

complement (9787.. 9927) 

<10500. .>11479 
join(<10500. .10749, 
10812. .11041, 
11105. .>11479) 
join(10500. .10749, 



/locus-tag="Anl8g0 099 0" 
/note="unnamed protein product; 
Title: strong similarity to EST 
an-1359 -Aspergillus niger" 
/codon-start=l 
/protein-id="CAK47176 .1" 
/db-xref="GI : 134084143" 
/translation="MRAATLLLVALSAVALALPT 
TNNVERAAADANVRQMDELTVAAI SK" 
/locus-tag="Anl8g00990" 
/ inf erence="protein 
motif :SignalP: 2.0" 
/locus-tag="Anl8g00990" 
/product=" unnamed" 
/locus-tag="Anl8g0 099 0" 
/number=l 

/locus-tag="Anl8g01000" 
/locus-tag="Anl8g01000" 



/locus-tag="Anl8g01000" 



10812 . . 11041, 
11105. .11479) 



exon 10500.. 10749 

intron 10750.. 10811 

exon 10812.. 11041 

intron 11042.. 11104 

exon 11105.. 11479 

gene <11980 . . >13464 

mRNA join(<11980. .12137, 

12198. .>13464) 
CDS join(11980. .12137, 

12198. .13464) 



/note="unnamed protein product; 

Similarity: the nucleotide 

sequence of the ORF shows strong 

similarity to the ESTs an-2192 and 

an-2204 of A. niger. Title: strong 

similarity to EST an-2192 

-Aspergillus niger" 

/ codon-start=l 

/protein-id="CAK47177.1" 

/db-xref="GI : 134084144" 

/translation="MYSYEKVNRDDDSDQGLLAE 

DSERVAPQPRVSRFPSWTNALLLL 

GLLFSLSLNVGWFVFQKGHNVPIQDLDPDAAMAK 

QRSPYTGLAWDTHKPYSHHSEYTS 

ENATHADEMWESLSMDPMVIAPTWEWAQSKGLSD 

SWAFPWDSNRRIYFIKVFHQLHCL 

KLMRHSYHELWSGQESSIPAPHIEHCLDSLRQDL 

MCKADDTPMPSLQLLNGGGEGQQM 

QCKDFDKLVAWSKAPERNACYKRLTDYKPIVHSI 

ERYAFCPEDSEHYPTMSKYFEEHG HYADPFSE" 

/locus-tag="Anl8g01000" 

/number=l 

/locus-tag="Anl8g01000" 
/number=l 

/locus-tag="Anl8g01000" 
/number=2 

/locus-tag="Anl8g01000" 
/number=2 

/locus-tag="Anl8g01000" 
/number=3 

/locus-tag="Anl8g01010" 
/locus-tag="Anl8g01010" 

/locus-tag="Anl8g01010" 

/inference="profile:COGS:COG0477" 
/note="unnamed protein product; 
Function: FLUl of C. albicans 
facilitates resistance to 

fluconazole and cycloheximide in 
the f luconazole-hypersensitive S. 
cerevisiae strain YKKB-13 lacking 
the ABC (ATP-binding cassette) 
transporter gene PDR5 . Function: 
FLUl of C. albicans facilitates 
resistance to mycophenolic acid in 
C. albicans. Function: FLUl of C. 
albicans is involved in the 
resistance to azol derivatives in 
C. albicans. Function: MDRl of C. 
albicans facilitates resistance to 
the anti-mitotic drug benomyl and 
to the dihydrof olate reductase 
inhibitor methotrexate. 
Similarity: N-terminus of the A. 
niger protein is app. 120 aa 
shorter than in FLUl of C. 
albicans. Similarity: the 
predicted A. niger protein shows 
strong similarity to fluconazole 



resistance protein FLUlof C. 

albicans, which is a permease 

belonging to the major facilitator 

superfamily. Title: strong 

similarity to fluconazole 

resistance protein FLUl - Candida 

albicans" 

/citation= [59] 

/codon-start=l 

/protein-id="CAK47178.1" 

/db-xref="GI : 134084145" 

/translation="MDEEDGHCKETEVTVQSTSP 

VDSADYDPFDPQTRSTTYKWITW 

LVAGLSTMVQLSTIIAAPVSPSILAHFHSDNALY 

RTLIVSIWELGEIVAPLLWGPLSE 

LYGRQWPLNIANLFFVAFLAGTAASTSIQMLIAF 

RFLSGAATAASAIGPGIVSDLFPE 

ESRGRAMSIMSLTGALGPWGPIIGSYLGEKAGW 

RWAFWLPTIATGTLSLLILWYRE 

TYSVTLQKRKARQQNPESGSKSPEDTDKTASQVF 

FKAILRPLRLLIRSPMLILVTFYL 

SWYGYTYLVMTTIAPLFQDVYGFSEGASGLAFL 

GLCLGLILGAFLCSFLLDRYVRTA 

RARSGTSKPEQRLPPVLIACFVMSGGLFLFGWTA 

QYHVQWIAPIIGTGIIGFGLVSTT 

ITLQTYWDLFGIYAASATSAMLVPRNACAAFLP 

LAGPPLFDRLGYNWGGTLLALIVL 

VFSLMPLIFINYGERLRGKNLLDD" 



exon 


11980. 


.12137 


/locus-tag= 
/number=l 


"Anl8c 


jOlOlO' 


intron 


12138. 


.12197 


/locus-tag= 
/number=l 


"Anl8c 


jOlOlO' 


exon 


12198. 


.13464 


/locus-tag= 
/number=2 


"Anl8c 


jOlOlO 


gene 


comple 
72) 


;ment (<14029 . .>151 


/locus-tag= 


"Anl8(: 


J01020' 


mRNA 


complement (join(<14029. 


/locus-tag= 


"Anl8c 


J01020' 




.14316 


!, 14411. .14845, 










14917. 


.>15172) ) 








CDS 


complement (join (14029 . . 


/locus-tag= 


"Anl8c 


J01020' 




14318, 


14411. .14845, 










14917. 


.15172) ) 


/EC-number= 


"1.13, 


,11.4" 



/inference= "profile: COGS :C0G3 435" 
/inf erence="similar to AA 
sequence:UniProtKB:AF173167 .3" 
/note="unnamed protein product; 
Complex: the estimated molecular 
mass of the purified gentisate 1, 
2-dioxygenase of P. alcaligenes 
was 154 kDa, with a subunit mass 
of 39 kDa. its structure is 
deduced to be a tetramer. Remark: 
gentisate 1, 2-dioxygenase of P. 
alcaligenes exhibits typical 
saturation kinetics and has an 
apparent Km of 92 microM for 
gentisate. this enzyme has broad 
substrate specificities towards 
alkyl and halogenated gentisate 
analogs. Similarity: the predicted 
A. niger protein shows strong 



similarity to gentisate 
1, 2-dioxygenase of P. alcaligenes 
and conserved hypothetical 
proteins from other procaryotic 
species. Title: strong similarity 
to gentisate 1 , 2-dioxygenases xlnE 
- Pseudomonas alcaligenes" 
/citation= [50] 
/codon-start=l 
/protein-id="CAK47179.1" 
/db-xref="GI : 134084146" 
/translation="MSPSACNTGSNGADSAEQLL 
QDLKKSNTLPLWTQMTRLNPPEPN 
PTAVPFIWRYDSIRPNLLRAGELVTEKQAERRVL 
MLVNPARDAPYTTDTLYAGLQLVM 
PNETAPAHRHT AFAMRY I I EGNGGFTAVHGKR I K 
MQKGDVILTPTWNYHDHGKDGTGP 
MIWLDGLDLPNFRHFPVHFVDHYDQPRYPAEDVD 
SATSPIVFPWDKMKAELDKAPGTW 
AVRRYLRADGSEGGSAERVDAGTSSLPRQETTSA 
VYHVIAGSGCSEIGDKTLVWETGD 
TFCVPSWYKYRHIAAEGETVYLYRFDDKPMISAL 
GFYRSIDTDLATLVSQ" 
complement (14029 . .14318 /locus-tag="Anl8g01020" 

/number=l 

complement (14319 . .14410 /locus-tag="Anl8g01020" 
/number=l 

complement (14411 . .14845 /locus-tag="Anl8g01020" 
/number=2 

complement (14846 . .14916 /locus-tag="Anl8g01020" 
/number=2 

complement (14917. . 15172 /locus-tag="Anl8g01020" 



<15614. .>17392 
join(<15614. .15661, 
15723. .16281, 

16347. .16503, 
16584. .16739, 
16803. .16942, 
17034. .>17392) 
join(15614. .15661, 
15723 . . 16281, 
16347. .16503, 
16584. .16739, 
16803. .16942, 
17034. .17392) 



/number=3 

/locus-tag="Anl8g01030" 
/locus-tag="Anl8g01030" 



/locus-tag="Anl8 



/inference="prof ile:COGS:COG0654" 
/inference="prof ile:PFAM:PF01360" 
/note="unnamed protein product; 
Function: Makl from N. 
haematococca specifically 
hydroxylates the phytoalexins 
medicarpin and 

maackiain, converting them to less 
fungitoxic derivatives. 
Localization: heterologous 
expression of the Makl cDNA 



construct in plants indicated that 

Makl protein accumulates in the 

plant cytoplasm, associated with 

endoplasmic reticulum membranes. 

Similarity: the predicted A. niger 

protein shows strong similarity to 

Makl from the fungal pathogen N. 

haematococca . Makl belongs to the 

flavin-containing mono-oxygenases . 

best matches are with putative 

salicylate hydroxylases of several 

procaryotic species. Title: strong 

similarity to maackiain 

detoxification protein 1 MAKl - 

Nectria haematococca endoplasmatic 

reticulum" 

/citation=[31] 

/citation= [65] 

/codon-start=l 

/protein-id="CAK47180 . 1" 

/db-xref="GI : 134084147" 

/translation="MASTRMSAQSTIADLRVPDY 

PSNCLQFTQQLTASATHTLPQKDA 

SVRLNIILAGAGLGGLATAIALAQAGHAVKIYEQ 

TPVLGEVGAGIQIPSNSTRILFSL 

GLQSYLEPYVTAPESISFRRWQNGKVIGLTKLIP 

NFVNNFKAPYYVIHRADFHSALCQ 

KAL DVGVE I E L GAKWD YDP I VG S I T L ADGTKH S 

ADLIVAADGIKSVARNWLQGDEM 

RFQGPGFAAYRAWDVGKMRRDPDLSWILEKPAL 

NIWIGDSRHVMTYTIGAGKAFNMV 

LSHPEMTDPGTWKPETALEDMKAEFQGWDPILSK 

I IGMVEKTVKWPLLTGTLLQNWTV 

GKLVI LGDAAHAMVP YMSQGAAMAVEDG I AL S RS 

LSHMTSRDQLQKALSIFQEVRKKR 

AGHMQEASLLNGKLWHFPDGSLQQARDEAMAPEV 

QGIPFSHSPNQWSDPATQMWCYGY 

DAEEAI DI AWME S LEARVDCVH " 



axon 


15614. 


.15661 


/locus-tag= 
/number =1 


"AnlBc 


J01030' 


intron 


15662. 


.15722 


/locus-tag= 
/number=l 


"AnlBc 


J01030' 


axon 


15723. 


.16281 


/ locus-tag= 


"Anl8g01030 








/number=2 






intron 


16282. 


.16346 


/locus-tag= 
/number=2 


"Anl8c 


J01030' 


axon 


16347. 


.16503 


/locus-tag= 
/numbar=3 


"Anl8c 


J01030' 


intron 


16504. 


.16583 


/locus-tag= 
/numbar=3 


"Anl8c 


J01030' 


axon 


16584. 


.16739 


/locus-tag= 
/number=4 


"AnlSc 


J01030' 


intron 


16740. 


.16802 


/ locus-tag= 


"Anl8g01030 








/number=4 






axon 


16803. 


.16942 


/ locus-tag= 
/number=5 


"Anl8c 


J01030' 


intron 


16943. 


.17033 


/locus-tag= 
/ number=5 


"Anl8c 


J01030' 


axon 


17034. 


.17392 


/locus-tag= 
/numbar=6 


"Anl8c 


J01030' 


gana 


<17766 


. .>20491 


/locus-tag= 


"Anl8c 


J01040' 


mRNA 


join(<17766. .17864, 


/locus-tag= 


"Anl8c 


J01040' 



17939. .18039, 
18108. .18343, 
18398. .19163, 
19373. .19935, 
20037. .20252, 
20325. .>20491) 

CDS join (17766 .. 17864, /locus-tag="Anl8g01040" 

17939. .18039, 
18108. .18343, 
18398. .19163, 
19373. .19935, 
20037. .20252, 
20325. .20491) 

/inference="prof ile:PFAM:PF00172" 

/inf erence="similar to AA 

sequence : UniProtKB : ENU56 0 9 7.1" 

/note="unnamed protein product; 

Phenotype: facB recessive loss of 

function mutants in A. nidulans 

are deficient in acetate induction 

of acetyl-CoA synthase, isocitrate 

lyase, malate 

synthase, acetamidase, and 

NADP-isocitrate dehydrogenase. 

Similarity: the predicted A. niger 

protein shows similarity to 

acetate regulatory DNA binding 

protein FacB (facB) from A. 

nidulans, which belongs to the 

GAL4-type zinc cluster 

transcriptional activators. Title: 

similarity to acetate regulatory 

DNA binding protein facB - 

Aspergillus nidulans" 

/citation= [37] 

/ codon-start=l 

/protein-id="CAK47181 . 1" 

/db-xref="GI : 134084148" 

/translation="MDFAPELPTPQGQSVKRRRV 

ALACDACRTRKSRCDGRRPKCGMC 

EDLGFECAYTPSTTTTNIIVQKDYLRDLEERVKS 

LETSMATVKSDLSGFASQINGSMP 

DAPRKENPLSNQSERFSDFAGTEDTIDAMGAVAF 

ADEEDCGFFGPSSNIAFLRHLSCA 

VAHSASAQKEITSPPLDRVAYDGGFVSATRPSSP 

SHDQRPESLEGEKFDKFALPPPEE 

SLALIHRYFADTGLLFPYIHPPTFFETYAELKDN 

SKRVRRTWLGLLNI ILAMAKLTAV 

SGTTSAETGISESAIYYHRALSLCKGEILRGTTL 

EVGRYS I YAPVRARVPVYLRATKI 

NTVQYLLVMGQYLQGTQKSVQAWTIHGLAVKAAL 

QLGLHSKGASRAFTPLEQEVRKRT 

WFGCWLDRVRANLFLLNRKTIQQANIRVSTLYK 

QIANIIDRIYGQNLGCDSPLSIGE 

TVGRVLGIENQLFSWVLGLPESLRQVTVQSMREE 

lERSEVGDEGHHKLYPLKFRIILT 

LRYFHVQILLHRPILVKFLDATGPSGLEADEVKL 

LNDIGYSSMNKCVDSAMGIIDIIH 

ELVSTTGWQKDLLGAWWYSLYYTFNAALVIIGAM 

WVQRNRDTSEDPVDNGGNNNNMDI 

YLSRAVLTLQNLNVGNRMVDRCKYYLDQLITLLG 

LQADRIAETGIISGMNNPVAGNST 









TGFDFPSLGIECGEFMIDDFFPGFARGSGFERW 


exon 


17766. . 


,17864 


/locus-tag="Anl8c 


J01040" 








/number=l 




intron 


17865. . 


,17938 


/locus-tag="Anl8c 


J01040" 








/number=l 




exon 


17939. . 


18039 


/locus-tag="Anl8g010 40" 








/number=2 




intron 


18040 . . 


,18107 


/locus-tag="Anl8c 


J01040" 








/ number=2 




exon 


18108. . 


,18343 


/locus-tag="Anl8c 


J01040" 








/number=3 




intron 


18344. . 


,18397 


/locus-tag="Anl8c 


J01040" 








/number=3 




exon 


18398. . 


,19163 


/locus-tag="Anl8c 


J01040" 








/ number=4 




intron 


19164. . 


19372 


/ locus-tag="Anl8c 


J01040" 








/ number=4 




exon 


19373. . 


,19935 


/locus-tag="Anl8c 


J01040" 








/number=5 




intron 


19936. . 


,20036 


/locus-tag="Anl8c 


J01040" 








/number=5 




exon 


20037. . 


,20252 


/locus-tag="Anl8c 


J01040" 








/number=6 




intron 


20253. . 


,20324 


/locus-tag="Anl8c 


J01040" 








/number=6 




exon 


20325. . 


20491 


/locus-tag="Anl8c 


J01040" 








/number=7 




gene 


complement (<20773 . .>225 


/locus-tag="Anl8c 


J01050" 




55) 








mRNA 


complement (join (<20773 . 


/locus-tag="Anl8c 


J01050" 




.21444, 


21501. .21638, 








21710 . . 


22119, 








22180. . 


22303, 








22370 . . 


>22555) ) 






CDS 


complement ( join (20773 . . 


/locus-tag="Anl8c 


J01050" 




21444, 21501 . .21638, 








21710 . . 


22119, 








22180. . 


,22303, 








22370. . 


,22555) ) 












/EC-number="l . 13 . 


.11.43" 



/inference="profile:COGS:COG3670" 

/inference="prof ile:PFAM:PF03055" 
/inf erence="similar to AA 
sequence :PIR:JC432 4" 
/note="unnamed protein product; 
Catalytic activity: 

lignostilbene-alpha, beta-dioxygena 
se isozymes (LSD-I, II, and III) 
of P. paucimobilis catalyse the 
reaction 

1 , 2-bis ( 4-hydroxy-3-methoxypheny 1 ) 
ethylene + 0(2) = 2 vanillin 
Cof actor : 

lignostilbene-alpha, beta-dioxygena 
se isozymes (LSD-I, II, and III) 
of P. paucimobilis require iron as 
a cofactor. Complex: LSD-I, II, 
and III consist of alpha 
alpha, alpha beta, and beta beta 
subunits, respectively, they show 
different specificities for 



complement ( 20773 . 
complement (21445 . 
complement (21501 . 
complement (21639 . 
complement (21710 . 
complement (22120 . 
complement (22180 . 



stilbene and styrene derivatives. 
Function: LSD of P. paucimobilis 
catalyses the oxidative cleavage 
of the interphenyl double bond in 
the synthetic substrate and 
lignin-derived stilbenes. it is 
responsible for the degradation of 
a diarylpropane-type structure in 
lignin. Similarity: the predicted 
A. niger protein shows strong 
similarity to subunit IsdB of 
lignostilbene-alpha, beta-dioxygena 
se isozyme LSD-III of P. 
paucimobilis and related 
dioxygenases from several plant 
species. Title: strong similarity 
to subunit IsdB of 

lignostilbene-alpha, beta-dioxygena 
se isozyme LSD-III -Pseudomonas 
paucimobilis" 
/citation=[16] 
/citation=[28] 
/ codon-start=l 
/protein-id="CAK47182 .1" 
/db-xref="GI: 13 4084149" 
/trans iation="MTETRTTAFPARPQFSGFMK 
PCRVEGETLNLEVLGDLPTDIDGV 
FYRVMPDPQLPPFIENDPWFNGDGNITAFRFHDG 
RVSFQQRYVRTEKFVRERQAQRAL 
lAQLTLFLWSIEGKYRNKFTDAVEFKVRSTANTN 
IVYFNGQLLALKEDSPPYAMDPVT 
LETRGLYDFDGQLPSMTFTAHPKFDPETGEMICF 
GYEARGDGTPDVCYYTVGPDGKFT 
EWWLVAPWAMI HDFAVTDNWVI FP 1 1 PQLC D I 
ERMKQGGEHWQWSPETPLYLGVIP 
RRGAKTADVKWFQYKNSFPGHTANAHEDSSGNLI 
IDLGLSEKNVFFWWPDAQGNAPEP 
SSIRSQLVRFTINPRAEDLNLTEPRILQADNSEF 
YRIDDRYATKPHRHVFFDMMDPSL 
GTDFARIAPVLGGGYPLYNSLARFDNVTGETEVY 
FPGKTHMVQEPVFIPRSELAAEGD 
GYLLALVNNYDSMSSELHLLDTADFTRARAKILL 
PVRLRPGLHGSWVDGSDVPLSAAR L" 
.21444 /locus-tag="Anl8g01050" 

/number=l 
.21500 /locus-tag="Anl8g01050" 

/number=l 
.21638 /locus-tag="Anl8g01050" 

/number=2 
.21709 /locus-tag="Anl8g01050" 



/number=2 
22119 /locus-tag="Anl8g01050" 



/number=3 
22179 /locus-tag="Anl8g01050" 



/number=3 
22303 /locus-tag="Anl8g01050" 



/number=4 

complement (22304. .22369 /locus-tag="Anl8g01050" 



/number=4 

complement (22370. .22555 /locus-tag="Anl8g01050" 



) 

<23096. .>24052 
join(<23096. .23 
23443. .23571, 
23667. .>24052) 
join(23096. .233' 
23443. .23571, 
23667. .24052) 



intron 
exon 



23096. .23375 
23376. .23442 
23443. .23571 



/number=5 

/locus-tag="Anl8g01060" 
375, /locus-tag="Anl8g01060" 



/locus-tag="Anl8g01060" 



/EC-number = " 5.3.3.10" 
/inference="prof ile: COGS: COGO 179" 
/inference="prof ile:PFAM:PF0155 7" 
/note="unnamed protein product; 
Catalytic activity: 
5-carboxymethyl-2-hydroxymuconate 
delta-isomerase from E. coli 
catalyses the reaction 
5-carboxymethyl-2-hydroxymuconate 
= 5-carboxy-2-oxohept-3-enedioate . 
Pathway: 

5-carboxymethyl-2-hydroxymuconate 
delta-isomerase from E. coli is 
involved in tyrosine and 
phenylalanine metabolism. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
5-carboxymethyl-2-hydroxymuconate 
delta-isomerase from E. coli and 
many putative 

2-hydroxyhepta-2 , 4-diene-l, 
7-dioate isomerases from several 
procaryotic and eucaryotic 
organisms. Title: strong 
similarity to 

5-carboxymethyl-2-hydroxymuconate 

delta-isomerase -Escherichia coli 

cytoplasm" 

/citation=[18] 

/ codon-start=l 

/protein-id="CAK47183 .1" 

/db-xref="GI : 134084150" 

/translation="MSLPWKRLIRFQASDGRILR 

GEPIIPAAGETDLGLITEVDGLQA 

KLLSGSDIYDVTGKTQVTDEIVTVKRILSPLAQS 

DVP IMRCVGLN YAKH I REAGRKPP 

PFPFIFFKPNTCVHDHGEPWIPRIAQNDQADYE 

VIGKDAKDVPVESALDYIAAYTVG 

NDISSRKLQRDPAYAGWPQWGFSKGFDTFAPLG 

PVLVAASEIPDPSQLQLTTIVDGE 

VRQQESVGDLLFDCRYLISYLSQGTTLQKGSVIM 

TGTPGGKGKFYC" 

/locus-tag="Anl8g01060" 

/number=l 

/locus-tag="Anl8g01060" 
/number=l 

/locus-tag="Anl8g01060" 



gene 
mRNA 



intron 



exon 



23572. .23666 



<24984. .>27426 
join(<24984. .25058, 



23667. .24052 



/number=2 

/locus-tag= 

/number=2 

/locus-tag= 

/number=3 

/locus-tag= 

/locus-tag= 



="Anl8g01060" 



'Anl8g01060" 



'Anl8g01070" 
'Anl8g01070" 



25166. .25301, 
25564. .25745, 
25812. .26003, 
26080. .26144, 
26206. .26646, 
26722. .27303, 
27423. .>27426) 



CDS 



join(24984. .25058, 



/locus-tag= 



="Anl8g01070" 



25166. .25301, 
25564. .25745, 
25812. .26003, 
26080. .26144, 
26206. .26646, 
26722. .27303, 
27423. .27426) 



/inference="profile: COGS: COGO 477" 
/note="unnamed protein product; 
Function: FLUl of C. albicans 
facilitates resistance to 
fluconazole and cycloheximide in 
the f luconazole-hypersensitive S. 
cerevisiae strain YKKB-13 lacking 
the ABC (ATP-binding cassette) 
transporter gene PDR5 . Function: 
FLUl of C. albicans facilitates 
resistance to mycophenolic acid in 
C. albicans. Function: FLUl of C. 
albicans is involved in the 
resistance to azol derivatives in 
C. albicans. Function: MDRl of C. 
albicans facilitates resistance to 
the anti-mitotic drug benomyl and 
to the dihydrof olate reductase 
inhibitor methotrexate. 
Similarity: N-terminus of the A. 
niger protein is app . 120 aa 
shorter than in FLUl of C. 
albicans. Similarity: the 
predicted A. niger protein shows 
strong similarity to fluconazole 
resistance protein FLUlof C. 
albicans, which is a permease 
belonging to the major facilitator 
superfamily. Title: similarity to 
fluconazole resistance protein 
FLUl - Candida albicans" 
/citation= [ 59 ] 
/ codon-start=l 
/protein-id="CAK47184 . 1" 
/db-xref="GI : 134084151" 
/translation="MRGVEELASENKDETRSGRR 
NLGGKQHFRGGGERSVDLALRPPR 
RGTPGSPQSQLAHGVSCSSWLVPRVRIPTDDPGD 
PLRWPSWLKLSVILSTSLVNFVSN 
MGGAGLSVAVPVLMQQLQRSQAEVTQLLTLNFLF 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



24984. .25058 

25059. .25165 

25166. .25301 

25302. .25563 

25564. .25745 

25746. .25811 

25812. .26003 

26004. .26079 

26080. .26144 

26145. .26205 

26206. .26646 

26647. .26721 

26722. .27303 

27304. .27422 

27423. .27426 

complement (<27797. .>28( 
67) 

complement ( join (<27797 . 
.28220,28259. .28342, 
28450. .28550, 
28602. .>28667) ) 

complement (join (27797. . 
28220, 28259 . .28342, 
28450. .28550, 
28602. .28667) ) 



LG I GN I FWVPL AVKFGKRAS II S S 

MLLQAGALVWCAVATGYDSLLGARCVLGFAAAAG 

ESIVPEIVADIFFVHERATMMAIY 

WLISGGSAVGPLIGGFMVEYASGTWRSYMWLCF 

ALASFNIVLLVFLFPESTFKRPEP 

SQAQELQLVAEHKETTSSFIENASAAAPEDSYTT 

NTPPFKGVLRPFHYNKDASFFKAV 

ISPLKLLVHPSVCWGIFTYGISLSPQIIMIFNFS 

ALLEASP YFFC SE YVGLME I GAI V 

GFLLACYGGGYLSDIVNRMIAQRSSSSSIRPEQR 

LISLIPGMAIGPAGCILLAFACGH 

KLHWAAIAVGFGMVSFGTVYTPNIALTYIVHLHQ 

QDAANCLVS I S LVKNLVAFTFLYV 

AVDWVEREGYVQVFMIMFMLNVLALLCAVPLYFL 

GRRKDLNVYNGMEGRA" 

/locus-tag="Anl8g01070" 

/number=l 

/locus-tag="Anl8g01070" 
/number=l 

/locus-tag="Anl8g01070" 
/number=2 

/locus-tag="Anl8g01070" 
/number=2 

/locus-tag="Anl8g01070" 
/number=3 

/locus-tag="Anl8g010 70" 
/number=3 

/locus-tag="Anl8g010 70" 
/number=4 

/locus-tag="Anl8g01070" 
/number=4 

/locus-tag="Anl8g01070" 
/number=5 

/locus-tag="Anl8g01070" 
/number=5 

/locus-tag="Anl8g010 70" 
/number=6 

/locus-tag="Anl8g010 70" 
/number=6 

/locus-tag="Anl8g01070" 
/number=7 

/locus-tag="Anl8g01070" 

/ number=7 

/locus-tag="Anl8g01070" 

/ number=8 

/locus-tag="Anl8g01080" 
/locus-tag="Anl8g01080" 



/locus-tag="Anl8 



/codon-start=l 

/product=" hypothetical protein" 
/protein-id="CAK47185. 1" 
/db-xref="GI : 134084152" 
/translation="MGPRKAFLNGYIVWRRIQP 
WRALAAQAVPLIHTSNADSSPATG 
FYSLRFWSHPHPPTVPANHTVSGTNTSFDRHVI 



sig-peptide complement (join (28506 . 

28550,28602. .28667) ) 



complement ( join (27800. 
28220,28259. .28342, 
28450. .28505) ) 



LVSNNSQWIINPWSIGRTLSWPGP 

VKSLEAGRYGHLRKHNPAAYEELNFASSNKHCCC 

VMLRAYYYPRVQEEYLLRLFLYRC 

LKDTASFATVRKKDQSLNVHESLQSTYPGGIFRV 

AAKADFPAEGSKSFRTQTMCWALL LKYGVF" 

/locus-tag="Anl8g01080" 

/ inf erence="protein 
motif :SignalP: 2.0" 
/locus-tag="Anl8g01080" 



complement (27797. .28220 

complement (28221. .28258 

complement (28259. .28342 

complement (28343 . .28449 

complement (28450 . .28550 

complement (28551 . .28601 

complement (28602. .28667 

<28874. .>30837 
join(<28874. .29683, 
29736. .29875, 
29925. .>30837) 
join(28874. .29683, 
29736. .29875, 
29925. .30837) 



/product="hypothetical proteii 
/locus-tag="Anl8g01080" 

/number=l 

/locus-tag="Anl8g01080" 
/number=l 

/locus-tag="Anl8g01080" 
/number=2 

/locus-tag="Anl8g01080" 
/number=2 

/locus-tag="Anl8g01080" 
/number=3 

/locus-tag="Anl8g01080" 
/number=3 

/locus-tag="Anl8g01080" 
/number=4 

/locus-tag="Anl8g01090" 
/locus-tag="Anl8g01090" 



/locus-tag="Anl8 



/EC-number="3 .1.1.5" 
/inference= "profile :PFAM:PFO 1735" 
/note="unnamed protein product; 
Catalytic activity: 
lysophospholipase catalyses the 
reaction 2 -lysophosphat idyl choline 
+ H(2)0 = glycerophosphocholine + 
a fatty acid anion. Pathway: 
lysophospholipase is involved in 
phospholipid degradation. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
the protein sequences 
lysophospholipase (sequence 5 and 
sequence 7) of patents 
WO0127251-A/5 and WO0127251-A/7 
from A. oryzae (AC# AX112082 and 
AC# AX112084) and to other fungal 
lysophospholipases . Title: strong 
similarity to lysophospholipase 
from patent WO0127251-A - 



sig-peptide 
mat -peptide 

exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



28874. .28921 

join(28922. .29683, 
29736. .29875, 
29925. .30834) 

28874. .29683 

29684. .29735 

29736. .29875 

29876. .29924 

29925. .30837 

complement (<30960 . . >322 
18) 

complement (join (<30960 . 
.31045,31081. .31220, 
31273. .31630, 
31702. .>32218) ) 
complement (join (30960. . 
31045,31081. .31220, 
31273. .31630, 
31702. .32218) ) 



Aspergillus oryzae 

extracellular /secret ion proteins" 

/citation=[14] 

/codon-start=l 

/protein-id="CAK47186 .1" 

/db-xref="GI : 134084153" 

/translation="MLSLLISAAAATLASALELP 

QGYSPDPVSCPTNLSWIRPAVGLS 

RDEAQWVEGRKNVILGSLDAYLKRLNLDDFDTDE 

YISRLNNTSQTPIMGMAISGGGFG 

SAYTGTGLIRALDDRLPAANEQRTGGLLQSMTYL 

SGLSGGSWPAVSFPSYNFPTADEI 

VDYWKPEIDRFFTVTNTSAEAATGKAIFEQIATK 

YLAGFEVALSDYLGRGFAYEFIPG 

QSGGLNTTFSGIRNLSNFINHQMPMPIIHLASVE 

PEDAEYYDLLVPSSNGTIFDLTPF 

EFGAWDGDVHAFTPTEWLGNQLSNGIPVNQSKCW 

KGFDRS S LVI GT S ADAFNFWYLE S 

VSNGTLGQFAKRSTTHESSLTKRLSQPANLNALV 

DAFQETFDLNLTQISYSKFPNPFT 

NLSLSTGNTHKSSTLNLVDGSETGQTIPLWGQIQ 

PARNVDFI lAWDDSQDADPYSWNN 

GTNLYNTYLAANATGLPFPIIPPSRTMMNLNYTL 

HPQFFGCDANLTTTGDDRAPIVLY 

MANAPYSAYTNFSFWQTETSRQQMGEIFVNSFDI 

VTQANGSvfDGEKAECMGCAAVERS 

LARVGMERTRQCQRCFERYCWDGTLDERDPGVLD 

PTLVLDPGVKFGLWNATNP Y " 

/locus-tag="Anl8g01090" 

/ inf erence="protein 

motif :SignalP: 2.0" 

/locus-tag="Anl8g01090" 



/product=" unnamed" 
/locus-tag="Anl8g0109 0" 
/number=l 

/locus-tag="Anl8g01090" 
/number=l 

/locus-tag="Anl8g01090" 
/number=2 

/locus-tag="Anl8g01090" 

/ number=2 

/locus-tag="Anl8g01090" 

/ number=3 

/locus-tag="Anl8g01100" 
/locus-tag="Anl8g01100" 



/locus-tag="Anl8g01100" 



/EC-number= "1.1.1.-" 
/inference="prof ile:COGS :COG0604" 
/inf erence="prof ile : COGS : COGl 0 6 4 " 
/inference="prof ile:PFAM:PF00107" 
/note="unnamed protein product; 
Remark: the patent does not 
provide further information about 



exon complement (30960 . .31045 

intron complement (31046 . .31080 

exon complement (31081 . .31220 

intron complement (31221 . .31272 

exon complement (31273 .. 31630 

intron complement (31631 .. 31701 

exon complement (31702 . .32218 

gene <32976 . . >33236 

mRNA <32976. .>33236 

CDS 32976.. 33236 

exon 32976.. 33236 



the function of the protein. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
the protein sequence alcohol 
dehydrogenase (sequence 19) of 
patent EP0845532-A/19 from an 
unclassified organism (AC# A92108) 
and to other zinc-containing 
dehydrogenases. Title: strong 
similarity to alcohol 
dehydrogenase from patent 
EP0845532-A - Unclassified 
organism cytoplasm" 
/ codon-start=l 
/protein-id="CAK47187 . 1" 
/db-xref="GI : 134084154" 
/translation="MSHAQITHRRAVRRTDDYTP 
HTPKVNIVTEPLPLPLSPTAVLIR 
VHAVALNYRDANIANGGNPWPVLPHGILGNDAAG 
EVI AVGEKVKT LAVGDRAAP I TDT 
EQITERATKRSWLAADEDGVMADYIVYDESVLCK 
LPAYLDWVQASMIPCAGVTAWSAL 
KGMRIGQTVLVQGTGGVSMFALKLALAAGLRWL 
T S S S DAKLQAI KEKYAGS AI LTVN 
YKNTPEWDEEVLKLTGGVGVDLWENGGTGSLVK 
SMRCTRRGGWSQVGYLGKQDPSE 
LSELVPTLIDRRINLRGINAGSKLDMEDFCEALS 
ASQTLLDDLIDKVFPFEQAEEAVE 
YVWQEKDIGSERKIQNSFYHLGSYVGTWSSAS " 
/locus-tag="Anl8g01100" 

/number=l 

/locus-tag="Anl8g01100" 
/number=l 

/locus-tag="Anl8g01100" 
/number=2 

/locus-tag="Anl8g01100" 
/number=2 

/locus-tag="Anl8g01100" 

/number=3 

/locus-tag="Anl8g01100" 
/number=3 

/locus-tag="Anl8g01100" 
/number=4 

/locus-tag="Anl8g01110" 

/locus-tag="Anl8g01110" 
/locus-tag="Anl8g01110" 
/ codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK47188 . 1" 

/db-xref="GI : 134084155" 

/translation="MADQPPMGRDFIDAKNVDPV 

DHNDQTDPEDSASQPRFPGPGSST 

TAPSTSKPHSNKLLNKLDPRYDSDVLESEKQENE 

KAQRRGWD" 

/locus-tag="Anl8g01110" 



/number=l 

gene complement (<33419 . .>354 /locus-tag="Anl8g01120" 

77) 

mRNA complement ( join (<33419. /locus-tag="Anl8g01120" 

.33465,33522. .34023, 
34080. .34956, 
35015. .35044, 
35095. .>35477) ) 

CDS complement ( join (33419 . . /locus-tag="Anl8g01120" 

33465, 33522 . .34023, 
34080. .34956, 
35015. .35044, 
35095. .35477) ) 

/EC-number="l 

/inference="profile:COGS:COG23 03" 
/inference="prof ile:PFAM:PF0 0 732" 
/inference="prof ile:PFAM:PF05199" 
/note="unnamed protein product; 
Catalytic activity: versicolorin B 
synthase from A. parasiticus 
catalyses the side chain 
cyclization of racemic versiconal 
hemiacetal to the bisfuran ring 
system of (-) -versicolorin B. 
Pathway: versicolorin B synthase 
from A. parasiticus is involved in 
the aflatoxin biosynthetic 
pathway. Remark: a splice site was 
detected upstream of the START 
codon. Remark: aflatoxins comprise 
a group of polyketide-derived 
carcinogenic mycotoxins. Remark: 
the genes encoding the aflatoxin 
biosynthetic enzymes in A. 
parasiticus are clustered. Remark: 
versicolorin B synthase from A. 
parasiticus possesses an 
amino-terminal sequence homologous 
to the ADP-binding region of other 
f lavoenzymes, but does not require 
flavin or nicotinamide cofactors 
for its cyclase activity. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
versicolorin B synthase from A. 
parasiticus and other 
oxidoreductases . Title: strong 
similarity to versicolorin B 
synthase vbs - Aspergillus 
parasiticus" 
/citation=[30] 
/ codon-start=l 
/protein-id="CAK47189 .1" 
/db-xref="GI : 134084156" 
/translation="MIRHVLLSLALQCMVTGTIA 
NPDTLSHNYDERGLFGDSEYDYW 
VGGGTAGNTIATRLAQKNYKVAVIEAGGSYELES 
VAEVPAADVLPVGSDPETAAAHDW 
GFVAENVPGANGRSIHYARGKCWGGSSAMNFMIY 
QRPTIESMEQWATAVNDSSYTFDQ 
TLPFYKNSVKFTPPNTQIRAKNATAGYDPSAYES 
TGGPLKVSYANYAMPFSTWMDLGM 



mat -peptide 



gene 
mRNA 
CDS 



KAIGINETQDFNLGSLMGGQYCASTIDPNGEVRS 
SSEESFLANKPSTLTTYANTLAKK 
I I FNNQKQATGVQVKGS AGN I YT I KANREVI VS A 
GAFQSPQLLMVSGVGPQDQLEEHG 
IQWANRPGVGQNMWDHPFFAPSYRVNVQTFTAI 
ANDFLGI VGQF INMVGFGNGPLTN 
PISDYLAWEKIPAALRSAFSSQTTKQLATFPSDW 
PEAEYISGAGYMGNVSNLLINQPE 
DGYQYASMLAVLITPTSRGNITLRSADTDDLPVI 
NPNWL ATQ S DQE VA I AMF KRVRAA 
FQSKAMAPVIIGNEYNPGLEVQSDEQILQWIKDN 
VMTLWHAACTCKMGTSDDEMAWD 
SQARVYGVQGVRWDASAFPFLPPGHPQSTVYML 
AEKIANEIINGA" 
complement (35418. .35477 /locus-tag="Anl8g01120" 



complement (join (33422 . 
33465,33522. .34023, 
34080. .34956, 
35015. .35044, 
35095. .35417) ) 

complement (33419. .33465 

complement (33466 . .33521 

complement (33522 . .34023 

complement (34024. .34079 

complement (34080. .34956 

complement (34957. .35014 

complement (35015. .35044 

complement (35045. .35094 

complement (35095. .35477 

<37406. .>38455 
<37406. .>38455 
37406. .38455 



/ inf erence="protein 
motif :SignalP: 2.0" 
/locus-tag="Anl8g01120" 



/product="unnamed" 
/locus-tag="Anl8g01120" 



/number=l 

/locus-tag="Anl8g01120" 



/number=l 

/locus-tag="Anl8g01120" 



/number=2 

/locus-tag="Anl8g01120" 



/number=2 

/locus-tag="Anl8g01120" 



/number=3 

/locus-tag="Anl8g01120" 



/number=3 

/locus-tag="Anl8g01120" 



/ number=4 

/locus-tag="Anl8g01120" 
/number=4 

/locus-tag="Anl8g01120" 
/number=5 

/locus-tag="Anl8g0113 0" 
/locus-tag="Anl8g0113 0" 
/locus-tag="Anl8g0113 0" 
/inference="prof ile:COGS :COG3000" 
/note="unnamed protein product; 
Remark: the patent does not 
provide further information about 
the function of the protein. 
Similarity: the predicted A. niger 
protein shows similarity to the 
protein fragment (SEQ ID NO: 



gene 
mRNA 
CDS 



37406. .38455 

<38805. .>39836 
<38805. .>39836 
38805. .39836 



mat -peptide 
exon 



38805. .38876 

38877. .39833 

38805. .39836 

<40728. .>42671 
join(<40728. .41152, 



51484) of patent EP1033405-A2 from 

A. thaliana (AC# AAG41385) and to 

putative sterol desaturase family 

proteins. Title: similarity to 

protein fragment SEQ ID NO: 51484 

from patent EP1033405-A2 - 

Arabidopsis thaliana" 

/ codon-start=l 

/protein-id="CAK47190 . 1" 

/db-xref="GI : 134084157" 

/translation="MSGTHPNPKDSMKSTWRRLD 

RAEWT I YHWF YE I LGVHPEHLDKE 

VP VHQKTEK I P YMRT WS QHVW I L YHAF I P L AVHH 

VYVS YTGQNFTPVGAFFFYS I AFK 

LIAIHQLQVMRRMGHVLGFLDGDQHGRDGVPDVG 

VAKWRSLISTSTFRPIMTVFLSY 

RVSQAPAQMSWGWLPLEIGLYGIILDFWFYWYHR 

LMHDVGSLWKYHRTHHLTKHPNPL 

LTLYADTEQEFFDIAGIPLMTYFSMRLMGMPMGF 

YEWWICHQYWFTELAGHSGLRMH 

ASPPSTLDWLLRLIGAELVIEDHDLHHRKGWKKS 

HNYGKQTRLWDKIFGTCHERIESR 

EGNVDYDNTVRMPIF" 

/locus-tag="Anl8g01130" 

/number=l 

/locus-tag="Anl8g01140" 

/locus-tag="Anl8g01140" 

/locus-tag="Anl8g01140" 

/EC-number="l . 1.1.169" 

/inference="prof ile:COGS:COG0240" 

/inference="prof ile:PFAM:PF02558" 

/note="unnamed protein product; 

Title: similarity to ketopantoate 

reductase -Salmonella typhimurium 

cytoplasm" 

/citation= [23] 

/citation=[42] 

/ codon-start=l 

/protein-id="CAK47191 . 1" 

/db-xref="GI : 134084158" 

/translation="MTSQRIHLIGLGSIGTLIAH 

SLRSLPAPPPITILLHRESAYQDF 

LSGGSKLRLQIGEDGPTQETTGYDAELLQGESPI 

QHLIVAVKASATIPALRSIQHRLG 

RHSVICLFQNGLGQIEELNERLFTDPLTRPTYMF 

G I MRHGVY I KAP FEAVLAGLTGSC 

ALGI VNNDQQAS S L S AS S TAS AQSRFL INRLVEA 

PIVRSSELPWMELHQAQLLKLATN 

CWNPLTALLDVRNGSLLANSELQEMRRQLLQEI 

SLVFRRLPEFQGLPDVEEQFSVAR 

LEEQLIGNIEKTASNSSSMREDVRAGRATEIAYI 

NGWIVRRGQQLGLDCPTNRCLTQL 

ILAKSSQGR" 

/locus-tag="Anl8g01140" 
/ inf erence="protein 
motif :SignalP: 2.0" 
/locus-tag="Anl8g01140" 
/product=" unnamed" 
/locus-tag="Anl8g01140" 
/number=l 

/locus-tag="Anl8g01150" 
/locus-tag="Anl8g01150" 



/locus-tag="Anl8g01150" 



/inference="profile: COGS :COG0 477" 
/inf erence="similar to AA 
sequence:UniProtKB:AF188621 . 1" 
/note="unnamed protein product; 
Function: FLUl of C. albicans 
facilitates resistance to 
fluconazole and cycloheximide in 
the f luconazole-hypersensitive S. 
cerevisiae strain YKKB-13 lacking 
the ABC (ATP-binding cassette) 
transporter gene PDR5 . Function: 
FLUl of C. albicans facilitates 
resistance to mycophenolic acid in 
C. albicans. Function: FLUl of C. 
albicans is involved in the 
resistance to azol derivatives in 
C. albicans. Function: MDRl of C. 
albicans facilitates resistance to 
the anti-mitotic drug benomyl and 
to the dihydrof olate reductase 
inhibitor methotrexate. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
fluconazole resistance protein 
FLUlof C. albicans, which is a 
permease belonging to the major 
facilitator superfamily. Title: 
strong similarity to fluconazole 
resistance protein FLUl - Candida 
albicans" 
/citation= [59] 
/codon-start=l 
/protein-id="CAK47192 .1" 
/db-xref="GI : 134084159" 
/translation="MSPIDEQLEMAEVDARPHTN 
GASKEEGLERPSSSGTTGSTGYSS 
SLEYNPQLHTLHSRNTELDLERHRTATSHALSRI 
ETQRLQHQLTVGESVKSRASRTPL 
PRMGANKDYPPPLPDREDYWEFDGPEDPLYPQN 
WELGTKIYISAILAFTSICSTFDS 
AIFSSSTTNVARVFGVGIEVATLSSSLYICGYAS 
GP L VWAP L S E L KGRKP P I WAML G 
FGIFNTAVAVAKDLQTIMICRFFCGVFGSCPLAV 
VAAVF S D I YNNRTRGVAI AMF S S T 
VFLGPLLAPFIGGFINTSYLGWRWTAYIPAFMGY 
AAFVLNMFFLKESYPPIILVSKAS 
ELRRRTKNWGIHAKQEEIEVDLRELLVNNFSRPL 
RLLFGEPLILAVTIYLSFIYGLLY 
CFLTAYTLVFQGVYGMSAGVGGLTLFGMWGLFI 
AASYIIFASRGYNKKLEANGGIPV 
PEWRLPPVIIGGALFAGGLFWFGWTGFTKSIHWI 
VPTLSGLFTGLGLLIIFIQLFNYL 
IDTYLMFAASAIAANTFCRSLVAASFPLFSRQMF 
NNMG I QWAAT L LGC VAS I L VP I P V 
AFYFYGKNLRKRSKFAPFYDKVAESHAEDPVENG 
VAEH" 

/locus-tag="Anl8g01150" 



intron 

intron 
exon 
gene 
tRNA 



41153. .41225 
41226. .42344 
42345. .42400 
42401. .42671 
42950. .43033 
42950. .43033 



<43783. .>45994 
join(<43783. .43852, 



43912 
44075 
44395 
45294 
45425 



4025, 
44344, 
45240, 
45376, 
>45994) 
join(43783. .43852, 
43912. .44025, 
44075. .44344, 
44395. .45240, 
45294. .45376, 
45425. .45994) 



/number=l 

/locus-tag="Anl8g01150" 
/number=l 

/locus-tag="Anl8g01150" 
/number=2 

/locus-tag="Anl8g01150" 
/number=2 

/locus-tag="Anl8g01150" 
/number=3 

/gene="tRNA-His (GTG) " 

/locus-tag="Anl8e01160" 

/gene="tRNA-His (GTG) " 

/locus-tag="Anl8e01160" 

/product="tRNA-His" 

/ infer ence= "prof ile rtRNAscan : 1 . 4" 

/note="codon recognized: CAC" 

/locus-tag="Anl8g01170" 

/locus-tag="Anl8g01170" 



/locus-tag="Anl8g01170" 



/EC-number= "2.4.2.30" 
/inf erence="prof ile:PFAM:PF00644" 
/inference="profile:PFAM:PF02877" 
/note="unnamed protein product; 
Catalytic activity: the NAP 
protein of Z. mays catalyses the 
reaction NAD(+) + 
{ADP-D-ribosyl } (N) -acceptor <=> 
nicotinamide + 

{ADP-D-ribosyl} (N+1 ) -acceptor . 
Function: the NAP protein of Z. 
mays is involved in programmed 

cell death or apoptosis. 
Localization: the NAP protein of 
Z. mays is a nuclear enzyme. 
Remark: the ADP-D-ribosyl group of 
NAD(+) is transferred to an 
acceptor carboxyl group on a 
histone or the enzyme itself, and 
further ADP-ribosyl groups are 
transferred to the 2 '-position of 
the terminal adenosine moiety, 
building up a polymer with an 
average chain length of 20-30 
units. Remark: the NAP 
polynucleotide sequences can be 
used for modulation of programmed 
cell death in eukaryotic cells, 
the method is used, specifically 
in plants, to induce, or protect 
against, programmed cell 
death, depending on the extent to 
which PARP activity is reduced. 



reducing expression of endogenous 
NAP class PARP only is also used 
to modulate programmed cell death, 
to increase growth rate and to 
produce plant cells that are more 
tolerant of stress (cold, chemical 
treatments, pathogens , pests , 
drought, heat, etc. , or during 
transformation) . particular 
applications are generation of 
plants that are resistant to fungi 
or nematodes; are male or female 
sterile ; or have better 
seed-shatter properties, the 
methods are also used to improve 
growth of transformed plant cells 
(and derived calli or complete 
plants) . Similarity: the predicted 
A. niger protein shows st 
rong similarity to the protein 
sequence poly (ADP-ribose) 
polymerase NAP protein of patent 
WO200004173-A1 from Z. mays (AC# 
AAY68834) and from many other 
eucaryotic organisms. Title: 
strong similarity to 
poly (ADP-ribose) polymerase NAP 
protein from patent WO200004173-A1 
- Zea mays nucleus" 
/citation=[61] 
/citation=[63] 
/citation= [64] 
/ codon-start=l 
/protein-id="CAK47193 .1" 
/db-xref="GI : 134084160" 
/translation="MTGQAFKGLVIAVAGSFSGG 
YKQANLKTIIQRHGGTFSSAVTED 
CTHLVTTQREVDNKSVKYTQARKVYTCNIVSLDW 
L VE S D S AGKKL DEKKF LMG S D I KK 
DDEPESPKKRTLEQALGINEDGTTKKLKDAQTVG 
TKQINVPVDDTCPLRLTFTVYIDP 
TGLIWDATLNQTSATNNNNKFYRIQLLHRNNEFR 
TWTHWGRVGEHGQHALLGGGGLDE 
AEYEFKKKFKDKSGLTWENRLDPPKKGKYTF I EK 
NYEEDTEDEDEDEDKWAKKPTKP 
KAEEVKCTLSAPVQDLVSFIFNKDFFQSTMASMS 
YDAQKLPLGKLSKRTLQNGFQALK 
DLSELIANPALASTKYDTSFTAAVEHLSNLYFTV 
IPHAFGRNRPPVLNNDNLLKREIE 
LLEALTDMEVANSIMKDARNTDTVHPLDRQFQGL 
NMQEMTPLEHTSTEFIELANYLNQ 
SRGHTHGVQYKVINIFRIERQGEKDRFQSSMYSN 
IQNSCRRLLWHGSRSTNFGGILSQ 
GLRIAPPEAPVSGYMFGKGVYFADMSTKSAGYCF 
SWGSGNRGLLLLCDVEVGNPMYER 
DTASFNAGQEAKAEAKIATLGRGRSIPGGWKDAG 
CVNEDLKGVLMPDVRMPTTNSNSR 
GLMYNEYIVYDVAQIQQKYLFHVDMR" 
axon 43783.. 43852 /locus-tag="Anl8g01170" 

/number=l 

intron 43853.. 43911 /locus-tag="Anl8g01170" 

/number=l 



exon 


43912. . 


,44025 


/locus-tag= 


"Anl8g01170 








/number=2 






intron 


44026. . 


,44074 


/locus-tag= 
/ number=2 


"Anl8c 


J01170' 


exon 


44075. . 


,44344 


/locus-tag= 
/number=3 


"Anl8(: 


J01170' 


intron 


44345. . 


,44394 


/locus-tag= 
/number=3 


"Anl8c 


J01170' 


exon 


44395. . 


45240 


/locus-tag= 


"Anl8g01170' 








/number=4 






intron 


45241. . 


,45293 


/locus-tag= 
/number=4 


"Anl8c 


J01170' 


exon 


45294. . 


,45376 


/locus-tag= 
/number=5 


"Anl8c 


J01170' 


intron 


45377. . 


,45424 


/locus-tag= 
/number=5 


"Anl8c 


J01170' 


exon 


45425. . 


,45994 


/locus-tag= 
/number=6 


"Anl8c 


J01170' 


gene 


<46879. 


, .>48537 


/locus-tag= 


"Anl8c 


J01180' 


mRNA 


join(<46879. .48454, 


/locus-tag= 


"Anl8c 


J01180' 




48518. . 


,>48537) 








CDS 


join(46879. .48454, 


/locus-tag= 


"Anl8c 


J01180' 




48518. . 


,48537) 









/EC-number="3 .5.1.11" 
/note="unnamecl protein product; 
Function: penicillin V 
amidohydrolase (PVA) from F. 
oxysporum is used for the 
enzymatic hydrolysis of penicillin 
V (phenoxy-methylpenicillin) to 
6-aminopenicillanic acid (6-APA) . 
6-APA is the active beta-lactam 
nucleus used in the manufacture of 
semi-synthetic penicillins. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
the protein sequence penicillin V 
amidohydrolase (PVA) of patent 
US5516679-A from F. oxysporum (AC# 
AAW00291). Title: strong 
similarity to penicillin V 
amidohydrolase PVA from patent 
US5516679-A - Fusarium oxysporum 
extracellular /secretion proteins" 
/ codon-start=l 
/protein-id="CAK47194. 1" 
/db-xref="GI : 134084161" 
/translation="MLVYLTNFFIAWLAYFWQPK 
KQPCGSLTLPHIPGVDIISMTRQE 
VHNYSVATSPPYNLADVHNLDFCNVSVTLTHPGE 
DDTVS VS VWLP LNNWNGRFQATGG 
GGLAAGIIGLAPAHPVSQGYATAATDGGLTLNGT 
SNPQTGAWILRPDGSINTALLLNF 
AHRSIYDMTIIGKTLTERFYGSPPRYSYWSGCST 
GGRQGYFAAAKYPNLFDGVLAGAP 
ALNFPRLIGYMFWPPVHMFHSAAPPQCVFDTFWK 
AIIDECDPLDGATDGLISDYNPQS 
CPFKPETLVGHTVTCPEMGSDSPVT I TAQHAT LV 
KQILQGPDLQDHPDLWTGLPPGAS 
FRGTANTQVINGSIVRPVPFFPIIGWIKNFVYRD 
P D YNVF DMTF D DFNT AYRL T L DG Y 
NGI LGSDDLNL SEFRRAGGKLLTWHGLADEL I PA 



exon 
intron 

gene 
tRNA 



46879. .48454 
48455. .48517 

48518. .48537 

complement (48889. 
) 

complement (48889. 



<49850. .>51919 
join(<49850. .498 
49981. .50173, 
50233. .50362, 
50419. .50614, 
50702. .50828, 
50883. .51127, 
51199. .51470, 
51538. .>51919) 
join(49850. .4988 
49981. .50173, 
50233. .50362, 
50419. .50614, 
50702. .50828, 
50883. .51127, 
51199. .51470, 
51538. .51919) 



SWTNAFWEGIDKTDGADVDEFYRV 

FLAPGLGHCSAGHGPKPVDLLGALVRWVEEGIAP 

DRLSAAAVKADGKEVTRELCRYPA 

T L VYDGHGDVN DAG S F S L VF VGA " 

/locus-tag="Anl8g01180" 

/ number=l 

/locus-tag="Anl8g01180" 
/number=l 

/locus-tag="Anl8g01180" 
/number=2 
.48960 /gene="tRNA-Ala (AGC) " 

/locus-tag="Anl8e01190" 
.48960 /gene="tRNA-Ala (AGC)" 

/locus-tag="Anl8e0119 0" 
/product="tRNA-Ala" 
/inference="prof ile:tRNAscan: 1.4" 
/note="codon recognized: GCU" 
/locus-tag="Anl8g01200" 
8, /locus-tag="Anl8g01200" 



/locus-tag="Anl8g01200" 



/EC-number="l . 14 . - . -" 
/inference="profile:COGS:COG212 4" 
/inference="prof ile:PFAM:PF00067" 
/inf erence="similar to AA 
sequence : UniProtKB : AFU8 1 806.1" 
/note="unnamed protein product; 
Function: ordl of A. flavus 
converts 0-methylster igmatocystin 
to aflatoxin Bl . Pathway: ordl of 
A. flavus catalyzes the last step 
of the aflatoxin biosynthetic 
pathway. Remark: aflatoxins 
comprise a group of 
polyketide-derived carcinogenic 
mycotoxins. Similarity: the 
predicted A. niger protein shows 
strong similarity to 
0-methylster igmatocystin 
(OMST) -oxidoreductase (ordl) from 
A. flavus, which belongs to the 
CYP64 family of cytochrome 
P450-type monooxygenases . Title: 
strong similarity to 
0-methylster igmatocystin 
oxidoreductase ordl - Aspergillus 
flavus" 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

intron 
exon 
gene 
mRNA 



49850. .49888 
49889 . . 49980 
49981. .50173 
50174. .50232 
50233. .50362 
50363. .50418 
50419. .50614 
50615. .50701 
50702. .50828 
50829. .50882 
50883. .51127 
51128. .51198 
51199. .51470 
51471. .51537 
51538. .51919 



/citation=[36] 

/codon-start=l 

/protein-id="CAK47195.1" 

/db-xref="GI : 134084162" 

/translation="MQHRHNPSSSCPTSQCGRGA 

FIAFTSVYIFFAVLTRKSKRPPLP 

PGPRRKPIVGNLWDLPDPSQQDWQHWLKHKDRYG 

PISSLSIMGQTIIVLNDARLAVEL 

LESRSSIHSSRPQQHFAEMAGWNNVLGAVKQSQR 

FRATRKNLHREIGSNVSVARFNEI 

QT AE VGRF L LRVL DAP DKLMKH I RKWVG YG YT I E 

PHDQDPLVDLADKAMMDFSMAMLP 

ATWAVDFFLPLKYLPSWFPGTEFMKIAQRYRKNV 

TAF S D I P YAFVKEQMRTGRFVP SF 

LSNLLESSDLEPGSEEENTVKWSAGSLYAGGADT 

TVSSIASFFLAMALFPEVQRKAQQ 

ELDTVIGTDRLPQYADREQLPYINALVKETFRWH 

PWPMSLTHTSTADDVCEGYFIPK 

GSSVLANIWAFTHDPAAYHDPMTFKPERFLSPKP 

ERDPHFLVFGFGRRVCPGRTLADV 

NVYLTVAQALAVFE I SKPVKNGKVKDVQPEFLPG 

VISHPAPFDVSIRPRSAKHLELLR 

SLEQKYPWEKSNAEDLKNI " 

/locus-tag="Anl8g01200" 

/number=l 

/locus-tag="Anl8g01200" 
/number=l 

/locus-tag="Anl8g01200" 
/number=2 

/locus-tag="Anl8g01200" 
/number=2 

/locus-tag="Anl8g01200" 
/number=3 

/locus-tag="Anl8g01200" 
/number=3 

/locus-tag="Anl8g01200" 
/number =4 

/locus-tag="Anl8g01200" 
/number=4 

/locus-tag="Anl8g01200" 
/number=5 

/locus-tag="Anl8g01200" 

/ number=5 

/locus-tag="Anl8g01200" 
/ number=6 

/locus-tag="Anl8g01200" 
/number=6 

/locus-tag="Anl8g01200" 
/number=7 

/locus-tag="Anl8g01200" 
/number=7 

/locus-tag="Anl8g01200" 
/number=8 

/locus-tag="Anl8g01210" 



complement (<52148 . .>537 , 

20) 

complement ( join (<52148 . /locus-tag="Anl8g01210" 
.52359, 52411. .52573, 
52635. .52939, 
52984. .53309, 
53368. .>53720) ) 
complement ( join (52148 . 
52359,52411. .52573, 



/locus-tag="Anl8 



52635. .52939, 
52984. .53309, 
53368. .53720) ) 

/inference="profile:COGS:COG0515" 
/inference="prof ile:PFAM:PF0 0 069" 
/note="unnamed protein product; 
Function: SRPK2 from M. musculus 
phosphorylates SF2/ASF, a member 
of SR splicing factors. Remark: 
overexpression of murine SRPK2 
causes disassembly of 
cotransfected SF2/ASF and 
endogenous SC35. SRPK family 
members may regulate the 
disassembly of the SR proteins in 
a tissue-specific manner. 
Similarity: the predicted A. niger 
protein shows similarity to SRPK2 
from M. musculus and strong 
similarity to putative 
serine/threonine protein kinases 
from several eucaryotic organisms. 
Title: similarity to 
SR-protein-specif ic kinase SRPK2 - 
Mus musculus nucleus" 
/citation= [39] 
/codon-start=l 
/protein-id="CAK47196 .1" 
/db-xref="GI : 134084163" 
/translation="MAGRITQCARRLGLLHAKLH 
LSCRHSMPTPTPHLRNITRASYQT 
IRYSSQWKSSLTVFPTTDALIDPSIEIEEETLPT 
YHPEKYYPVQQGEVLDNRYQVLAK 
LGYGVTSTVWLGRDLRDSKYWLKIYVTGQEKNH 
ELEIYNRMNAVEVEHPGRDLVRRL 
FDHFTVTGPHGPHVCLVHEPMGMSADTLLQKYIP 
GNTMTLDEMKTCIRQLLIALDFLH 
SAARIVHTGKDLQLKNLLLPVPNTKTLETLEERE 
VNDPSPRKILKDRTIYLSTVYNPG 
GSGLPLISDFGEARFGDVEKRDDIMPNMYRAPEV 
VLKENWNYKVDIWNVAMVAWDIVI 
PRHMFDGRNADGIFDDRVHIAEMIALMGPPPASF 
RERCRLAYVFWDEQGNWKDLAPIP 
DISLESLGADIPGENREGFFRWLRKALQWNAEDR 
PTATEFLFDEWLMEGLRRPNNEDA KT" 
exon complement(52148. .52359 /locus-tag="Anl8g01210" 

/number=l 

intron complement ( 52360 .. 52410 /locus-tag="Anl8g01210" 

/number=l 

exon complement (52411. .52573 /locus-tag="Anl8g01210" 

/number=2 

intron complement ( 52574 .. 52634 /locus-tag="Anl8g01210" 

/number=2 

exon complement (52635. .52939 /locus-tag="Anl8g01210 " 

/number=3 

intron complement ( 52940 .. 52983 /locus-tag="Anl8g01210" 



gene 
mRNA 



complement (52984. .53309 
) 

complement (53310. .53367 
) 

complement (53368 . .53720 
) 

complement (<54105. .>557 
62) 

complement (join(<54105. 
.54731,54787. .55334, 
55392. .55500, 
55559 . .>55762) ) 
complement (join(54105. . 
54731, 54787. .55334, 
55392. .55500, 
55559. .55762) ) 



/number=3 

/locus-tag="Anl8g01210" 



/ number=4 

/locus-tag="Anl8g01210" 



/number=4 

/locus-tag="Anl8g01210" 



/number=5 

/locus-tag="Anl8g01220" 



/locus-tag="Anl8g01220" 



/locus-tag="Anl8 



:"profile:COGS:COG0477" 
"similar to AA 



/inf erence= 
/ inf erence= 
sequence :PIR:S6 4826" 
/note="unnamed protein product; 
Function: in S. cerevisiae, 
mutants (dal5) that lack 
allantoate transport have been 
isolated, these strains also 
exhibit a 60% loss of allantoin 
transport capability. Regulation: 
in S. cerevisiae Dal5 appears to 
be sensitive to nitrogen 
catabolite repression, feedback 
inhibition, and trans-inhibition. 
Regulation: in S. cerevisiae 
allantoate uptake is constitutive. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
allantoate permease gene (DAL5) 
from S. cerevisiae, which belongs 
to the major facilitator 
superfamily. Title: strong 
similarity to allantoate permease 
Dal5 - Saccharomyces cerevisiae" 
/citation=[7] 
/citation= [9] 
/ codon-start=l 
/protein-id="CAK47197.1" 
/db-xref="GI : 134084164" 
/translation="MEEKLPGAFSQALPVDRTDD 
AVEYLQGHAAADSPQIDLRALRRK 
IDWHLIPFMFSCYVLQFLDKVMLNYAAVMGMKTE 
LHLAGNEYTNTATWFFLAYLIAEA 
PNIYCLQKVPAAKWLGVNVALWGVAAAASAGAKN 
YATLLTARVFLGIFEATIGPSLML 
ISSQYYTKSEQAPRFTLWYNGLGVAQIIGGLVSF 
GFQHVHHG AT L AG WR IMFLVIGLV 
TVLIGVLTLLFIPDTPMKAKWLSEEEKVALLQHV 
SVNQTGVWSSAINLKQIWEGVLDI 
QLWLLVLITILISVSSGWTTYSATLIAGFGYSG 
PISALLNMPSGIVSIFFTLLVGFG 
IRRTSHRWAWNAFCTIPGIIGGGLLSFLPKSNKA 



GVLIGIYLVNAIVATLPILYQWTM 
ANCAGHTKRAFSSALIAGSFSVGNIIGPQTFQAR 
DAPEYRPAKIAVLATQAAAAVMSV 
VLF AY YVWENRRRDRRYGKVE D SMVDE AKWAG L T 
DKENTRFRYVY" 
complement (54105. .54731 /locus-tag="Anl8g01220" 

/number=l 

complement (54732 . .54786 /locus-tag="Anl8g01220" 
/number=l 

complement (54787. .55334 /locus-tag="Anl8g01220" 
/number=2 

complement (55335. .55391 /locus-tag="Anl8g01220" 

/ number=2 

complement (55392. .55500 /locus-tag="Anl8g01220" 
/number=3 

complement (55501. .55558 /locus-tag="Anl8g01220" 
/number=3 

complement (55559. .55762 /locus-tag="Anl8g01220" 



/number=4 

/locus-tag="Anl8g0123 0" 
/locus-tag="Anl8g01230" 



<56384 


.>58444 


join(<56384. .56645 


56694. 


56761, 


56818. 


57009, 


57069. 


57368, 


57430. 


57711, 


57897. 


58158, 


58326. 


>58444) 


join(56384. .56645, 


56694. 


56761, 


56818. 


57009, 


57069. 


57368, 


57430. 


57711, 


57897. 


58158, 


58326. 


58444) 



/locus-tag="Anl8g01230" 



/note="unnamed protein product; 
Function: M. grisea Pthll is a 
pathogenicity gene. Function: M. 
grisea Pthllp likely is involved 
in host surface recognition. 
Function: M. grisea pthll mutants 
of strain 4091-5-8 are 
nonpathogenic due to a defect in 
appressorium differentiation. 
Localization: in M. grisea, a 
Pthll-green fluorescent protein 
fusion localised to the cell 
membrane and vacuoles. Similarity: 
similarity of the predicted A. 
niger protein and M. grisea Pthll 
is limited to the N-terminal half 
of the protein sequences. Title: 
similarity to integral membrane 
protein PTHll - Magnaporthe grisea 
plasma membrane" 
/citation= [52] 



/codon-start=l 

/protein-id="CAK47198 . 1" 

/db-xref="GI : 134084165" 

/translation=" MGWVHHAS PE VE AE S Q YRL I 

LGVCLGLTLLMIITVCLRLAIRFN 

ARRLEASDFVMLVTMVGYLRWIDDPTANRCQIFS 

IIYSALCVAQSRYGLGLPLSLRPK 

ADLPDYTKLNYAGRPFYQLGIAGFKASLCLNYLR 

LISGTSKNFYRILIWAVIAVSTLG 

HLAGTLVLIFDCQPVERAWNPNISGSCLPAGPTF 

YGLAIFTIICDITIIILPIPLLLQ 

LNIKTAQKAGWCLFLLGLFTTICSILRLTQIHR 

VAYGDGNS TML VLWGT I EFNVGN I 

VTCVPFLAPLLKGAVRDFRSYSGRKGYDSRSYAL 

QTWSKDPRSQLRSTTSAAPQPKRT 

PSEELILESGGVDEGGIHMTVELRVSLEKRPTTE 

HDGINSAYASQDQIRPGSYWVDVT 

DTSQWSRAEQARPTVPDGIFPAIKPSSIQEESCA 

SRETLMWHPDGQPRAATELCPAH 

ASWPVGLETGTKASSTTPSPTRFQPNILSAVKAL 

LGPALFHLTH" 



exon 


56384. 


.56645 


/locus-tag= 
/number=l 


"Anl8c 


J01230" 


intron 


56646. 


.56693 


/locus-tag= 
/number=l 


"Anl8c 


J01230" 


exon 


56694. 


.56761 


/locus-tag= 
/number=2 


"AnlBc 


J01230" 


intron 


56762. 


.56817 


/locus-tag= 
/number=2 


"Anl8c 


J01230" 


exon 


56818. 


.57009 


/locus-tag= 
/number=3 


"Anl8c 


J01230" 


intron 


57010. 


.57068 


/locus-tag= 
/number=3 


"Anl8c 


j01230" 


exon 


57069. 


.57368 


/locus-tag= 
/number=4 


"Anl8c 


J01230" 


intron 


57369. 


.57429 


/locus-tag= 
/number =4 


"AnlBc 


J01230" 


exon 


57430. 


.57711 


/locus-tag= 
/number=5 


"AnlSc 


J01230" 


intron 


57712. 


.57896 


/locus-tag= 
/number=5 


"Anl8c 


J01230" 


exon 


57897. 


.58158 


/locus-tag= 
/ number=6 


"Anl8c 


J01230" 


intron 


58159. 


.58325 


/locus-tag= 
/ number=6 


"Anl8c 


J01230" 


exon 


58326. 


.58444 


/locus-tag= 
/number=7 


"Anl8c 


J01230" 


gene 


<59070 


. .>60560 


/locus-tag= 


"AnlBc 


J01240" 


mRNA 


join(<59070. .59441, 


/locus-tag= 


"AnlBc 


J01240" 




59521. 


.59806, 










59879. 


.60178, 










60237. 


.60312, 










60374. 


.>60560) 








CDS 


join(59070. .59441, 


/locus-tag= 


"AnlBc 


J01240" 




59521 . 


.59806, 










59879. 


.60178, 










60237. 


.60312, 










60374. 


.60560) 









/inference="profile:COGS:COG0654" 
/inference="prof ile:PFAM:PF01360" 
/note="unnamed protein product; 
Catalytic activity: NahW of P. 



stutzeri catalyzes the conversion 
salicylate + NADH +0(2) <=> 
catechol + NAD(+) + H(2)0 + C0(2). 
Pathway: NahW of P. stutzeri is 
involved in the metabolization of 
naphtalene and salicylates (lower 
naphtalene degradation pathway) . 
Similarity: the predicted A. niger 
protein shows similarity to 
salicylate hydroxylase (nahW) of 
P. stutzeri, which belongs to the 
NADH-dependent monooxygenase 
superfamily. Title: similarity to 
salicylate hydroxylase nahW 
-Pseudomonas stutzeri" 
/citation= 

< User Break > 

sequence :PIR:S456 05" 

/note="unnamed protein product; 
Catalytic activity: alcohol + NAD+ 
= aldehyde or ketone + NADH. 
Pathway: alcohol dehydrogenase 
ADH-T from B. stearothermophilus 
is involved in glycolysis / 
gluconeogenesis; fatty acid 
metabolism; bile acid 
biosynthesis; tyrosine metabolism; 
glycerolipid metabolism. Remark: 
the protein sequence of alcohol 
dehydrogenase ADH-T from B. 
stearothermophilus NCA1503 is 
covered by patent JP04218378-A 
(AC# AAR26874) . Similarity: the 
predicted A. niger protein shows 
strong similarity to thermostable 
alcohol dehydrogenase ADH-T from 
B. stearothermophilus NCA1503, 
which belongs to the zinc alcohol 
dehydrogenase (ADH) family. Title: 
strong similarity to thermostable 
alcohol dehydrogenase adhT - 
Bacillus stearothermophilus" 
/citation= [15] 
/citation= [58] 
/ codon-start=l 
/protein-id="CAK47216 .1" 
/db-xref="GI : 134084183" 
/translation="MEYTFKVFCGSSDGKWEKL 
TTRRLGDNDVFIETTHSGLCGTDE 
HFLHCDQALGHEGVGWKHVGPSVSSVKVGDRVG 
FGFIRRVCGRCDNCISGCDHHCRE 
KRAYGQHDFDVGSFSHGTVWDADAVYPIPEGYDS 
AHAAP L LC AG A S VWAC L TNNG I RP 
S DRVGVMG I G G L G H L A I KL ARAL R YN WAL S S S E 
KKREEALEFGASEFYRFPNTQTPN 
HIKPVKHLLLCGSSDVDYASWVSHLPSKQNAIDH 
ANIYRSLLDLVDTNGTIYHISVTL 
KPTPIPLVPFGQKGIRIQGCFITSRRNLQELLEF 
AARFDIKPTIMTFPLTRNGLEETI 
EKLRAGRIRYRAVLEYQAP " 
axon complement (100363. .1011 /locus-tag="Anl8g01430" 

41) 



complement (101142. .1012 
13) 



exon 

intron 

exon 



complement (101214.. 101^ 

96) 



<101875 

join(<10 

102203. 

102259. 

102392. 

102621. 

103205. 

joindOl 

102203. 

102259. 

102392. 

102621. 

103205. . 



104542 
1875. .101958 
102219, 
102360, 
102399, 
103153, 
>104542) 
875. .101958, 
102219, 
102360, 
102399, 
103153, 
104542) 



101875. .101958 
101959. .102202 
102203. .102219 



/number=l 

/locus-tag="Anl8g01430" 
/number=l 

/locus-tag="Anl8g0143 0" 
/number=2 

/locus-tag="Anl8g01440" 
/locus-tag="Anl8g01440" 



/locus-tag="Anl8 



/inference="profile: COGS: C0G3 496" 

/note="unnamed protein product; 

Similarity: the predicted A. niger 

protein shows local similarity to 

the hypothetical protein mll8086 

from M. loti. Title: strong 

similarity to hypothetical protein 

CAD70872.1 - Neurospora crassa" 

/codon-start=l 

/protein-id="CAK47217.1" 

/db-xref="GI : 134084184" 

/translation="MTAKCMVIKKAEEDHTEFLG 

LHHVGSDKNFDNCVTKEVRGPQIL 

KKFLLRQWIPAQGGREQQPESLILSPVRKNLPSI 

SVDGILLHAALLTVNLILKLGGWP 

GLLKFMLRTFAISSVLAPIGLIGFLFLVSFTPRN 

IKSLSKDKSVIGKPLLFPITLDHT 

RLSPIKNNFTFNVLFVGIPVGISCRFGRLLSIDA 

KHTDEEECTERSSLLRLLQTYFSS 

WFSFDSARYLHRGDDTLSLENKLNKFLREQNENP 

AKWPYAYMLSVPRFLWWERSWTW 

WYLYSESKELDAVIMEINNSFDEKRNVLFKVRRT 

RIYTESPEKGFEQLLDCKEEHLDE 

DKRVFSLIPQHGKYAYKATWKKEIFSSPFEKVGE 

TVSSTFLDPWPSSWSGNRSLSNT 

TTFDPSGAPRMIARLWCKVPPIDPGKASSFQIFS 

ILLIWTVNVTLATPRILFQAIRLH 

VMNLMRMMEHPDVRPGSEPRRPSKGERKDAPLKM 

LHSLFDLHANLQSRMLERFFREYL 

KHIVASYPGDLEVTYIPCKSVFKTTICLRSTQYS 

AEGQDPRRLRLEVMDPAFYSRIVN 

SPNAGTAMAKETKPGRSPADALSSPVIASDITQS 

LQLLDVTVSENRTYDKACSRISSW 

VLCLRRWLTSSFMDSFVCKALPPRTQEEYVSCLI 

QLWLEKLTWDFLSPHQIYRVIRAA 

MLQWAICWILF" 

/locus-tag="Anl8g01440" 

/number=l 

/locus-tag="Anl8g01440" 
/number=l 

/locus-tag="Anl8g01440" 
/number=2 



intron 

exon 

intron 

exon 

intron 

exon 

intron 



102220. .102258 

102259. .102360 

102361. .102391 

102392. .102399 

102400. .102620 

102621. .103153 

103154. .103204 

103205. .104542 

<106231. .>106803 
join(<106231. .106565, 
106647. .>106803) 
join(106231. .106565, 
106647. .106803) 



/locus-tag= 

/number=2 

/locus-tag= 

/number=3 

/locus-tag= 

/number=3 

/locus-tag= 

/number=4 

/locus-tag= 

/number=4 

/locus-tag= 

/number=5 

/locus-tag= 

/number=5 

/locus-tag= 

/number=6 

/locus-tag= 

/locus-tag= 



"Anl8g01440" 

"Anl8g01440" 

"Anl8g01440" 

"Anl8g01440" 

"Anl8g01440" 

Anl8g01440" 

"Anl8g01440" 

"Anl8g01440" 

"Anl8g01450" 
"Anl8g01450" 



106231. .106565 



/locus-tag="Anl8g01450" 

/EC-number= "1.14.-.-" 
/note="unnamed protein product; 
Function: ordA of A. parasiticus 
is involved in the aflatoxin 
biosynthesis and converts 
0-methylsterigmatocystin (OMST) to 
aflatoxins Bl or Gl and converts 
dihydro-O-methylsterigmatocystin 
(DHOMST) to aflatoxins B2 or G2 . 
Remark: aflatoxins comprise a 
group of polyketide-derived 
carcinogenic mycotoxins. Remark: 
it is assumed that the ORF is 
N-terminally shorter and has 
another start codon 5 ' to the 
predicted one ; the ORF is around 
400 amino acids shorter than most 
of the homologues cytochrome p450 
proteins. Remark: ordA of A. 
parasiticus is also called 
cytochrome p450 64, cyp6 4 or omst 
oxidoreductase . Similarity: the 
ORF shows similarity to several 
cytochrome P450 related proteins 
from different species. Title: 
strong similarity to 
0-methylsterigmatocystin 
oxidoreductase ordA - Aspergillus 
parasiticus" 
/citation=[43] 
/ codon-start=l 
/protein-id="CAK47218 . 1" 
/db-xref="GI : 134084185" 
/translation="MTFNPERFLTTESYQAEHDP 
HNLAFGFGRRICPGRGFADSTIFL 
TWRSLQAFRIAKISEDGREIEPIVEYLPGVISH 
PKPFAISITPRSKEHESFIRSIEI 
EHPWEKGDLVS I LSVNVLVWASKLRVFT I LT SL 
YRLPLIGNCKPPNLLLGRMGLIRF SLH" 
/locus-tag="Anl8g01450" 
/number=l 



intron 
exon 



106556. .106646 

106647. .106803 

<107279 . .>108340 
join(<107279. .107327, 
107380 . .107740, 
107795. .107907, 
107966. .107988, 
108045. .108097, 
108151 . .>108340) 
join(107279. .107327, 
107380. .107740, 
107795. .107907, 
107966. .107988, 
108045. .108097, 
108151. .108340) 



join(107279. .107327, 
107380. .107387) 



join(107388. .107740, 



/locus-tag="Anl8g01450" 
/number=l 

/locus-tag="Anl8g01450" 
/number=2 

/locus-tag="Anl8g01460" 
/locus-tag="Anl8g01460" 



/locus-tag="Anl8 



/EC-number="l 

/inference="prof ilerCOGS :COG102 8" 
/inference="prof ile:PFAM:PF0 0106" 
/note="unnamecl protein product; 
Catalytic activity: cyclohexanol 
dehydrogenases convert 
cyclohexanol + NAD(+) <=> 
cyclohexanone + NADH. Function: 
chnA of A. sp. is an alcohol 
dehydrogenases proposed to 
catalyze the conversion of 
cyclohexanol to cyclohexanone (EC 
1. 1. 1. 245) . Phenotype: 
cyclohexanol was detected as the 
major intermediate accumulated in 
the chnA mutant of A. sp. Remark: 
chnA of A. sp. is encoded in the 
gene cluster for cyclohexanol 
oxidation. Similarity: the ORF 
shows similarity to several 
dehydrogenases from different 
species and with various 
specificities. Title: strong 
similarity to cyclohexanol 
dehydrogenase chnA - Acinetobacter 
sp" 

/citation= [ 57 ] 

/ codon-start=l 

/protein-id="CAK47219 .1" 

/db-xref="GI: 13 40 8 4186" 

/translation="MAHLLQGTAFITGAGSGIGA 

GVARTFIQNGISKLALVDINLSIL 

QKVS E S LKQTF PNVE L L T S AVDVTNE VHVDNAVQ 

KAATTFGRVDIGVNSAGVGDPGEP 

TDRL S LKDWQRTVH INQTGVWL SQRAL I QQML TQ 

KSRGTRHGRGVIVNVCSSLGITAS 

ATGIPFPAYTSSKHGVMGLTKMDAKYYAPSGIRI 

NAVCPGFVDTPMVSKAVDSGVFKK 

EVE S AP VGR I G D VE EITDSILFLASPMSSFIYGA 

GLWDGGYGL" 

/locus-tag="Anl8g01460" 

/ inf erence="protein 
motif :SignalP: 2.0" 
/locus-tag="Anl8g01460" 





107795. . 


.107907, 










107966. . 


.107988, 










108045. . 


.108097, 










108151. . 


.108337) 














/product="u 


nnamed 




exon 


107279 . . 


,107327 


/locus-tag= 


"Anl8g01460 








/number=l 






intron 


107328. . 


.107379 


/locus-tag= 


"Anl8gi 


01460' 








/number=l 






exon 


107380. . 


.107740 


/locus-tag= 


"Anl8gi 


01460' 








/number=2 






intron 


107741. . 


.107794 


/locus-tag= 


"Anl8gi 


01460' 








/number=2 






exon 


107795. . 


.107907 


/locus-tag= 


"Anl8gi 


01460' 








/number=3 






intron 


107908 . . 


,107965 


/ locus-tag= 


"AnlBgi 


01460 








/number=3 






exon 


107966. . 


.107988 


/locus-tag= 


"Anl8gi 


01460' 








/number=4 






intron 


107989. . 


.108044 


/locus-tag= 


"Anl8gi 


01460' 








/number=4 






exon 


108045. . 


.108097 


/locus-tag= 


"Anl8gi 


01460' 








/number=5 






intron 


108098 . . 


.108150 


/locus-tag= 


"Anl8gi 


01460' 








/number=5 






exon 


108151. . 


,108340 


/locus-tag= 


"AnlBgi 


01460 








/number=5 






gene 


<108826. 


. .>111209 


/locus-tag= 


"Anl8gi 


01470 


mRNA 


join(<108826. .108944, 


/locus-tag= 


"Anl8gi 


01470' 




109020. . 


.109023, 










109089 . . 


.109305, 










109377. . 


,109778, 










109841 . . 


,10990b. 










110160 . . 


,110181, 










110244. , 


,110293, 










110355. , 


,110492, 










110524. , 


,110752, 










110866. . 


,110960, 










111135. . 


.>111209) 








CDS 


joindOE 


J826. .108944, 


/locus-tag= 


"Anl8gi 


01470' 




109020 . . 


.109023, 










109089 . , 


,109305, 










109377. , 


,109778, 










109841 . , 


,109905, 










110160. , 


,110181, 










110244. . 


,110293, 










110355. . 


,110492, 










110524. . 


,110752, 










110866. . 


.110960, 










111135. . 


.111209) 









/EC-number= "1.14.-.-" 
/inference="profile: COGS: COGO 493" 
/note="unnamed protein product; 
Remark: aflatoxins are 
polyketide-derived secondary 
metabolites. Remark: moxY of A. 
parasiticus is expressed 
concurrently with genes involved 
in aflatoxin biosynthesis and it 
lies on one end of the cluster of 
this genes. Therefore moxY of A. 
parasiticus is presumably also 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 



108826. 
108945. 
109020. 
109024. 
109089. 
109306. 
109377. 
109779. 
109841. 
109906. 
110160. 
110182. 
110244. 
110294. 
110355. 
110493. 
110524. 



.108944 
.109019 
.109023 
.109088 
.109305 
.109376 
.109778 
.109840 
.109905 
.110159 
.110181 
.110243 
.110293 
.110354 
.110492 
.110523 
.110752 



involved in aflatoxin 

biosynthesis. Similarity: the ORF 

shows similarity to monooxigenases 

from several species and with 

different functions. Title: strong 

similarity to monooxygenase moxY 

-Aspergillus parasiticus" 

/citation= [55] 

/codon-start=l 

/protein-id="CAK47220 . 1" 

/db-xref="GI : 134084187" 

/translation="MANEQTCSPSINLNTHENMC 

KGRIQQRRYPATCNMLKKPRCVPH 

TQIELCTVLFAFSNMI SASTTDTFTQVI I IGAGM 

SGLAMACQLKKQLCCEDFVIYDRA 

PSFGGTWYFNKCCGVDIPAAFYSFSFALYPQFTC 

FFPKQEE I LQY I HGVADEF S VALK 

LVGHTEWEGADWQDSEQCWEVRLREIPSGRKFTR 

RCRILISAVGGLTNPKHVMLQGIE 

RFQGNIVHTALWDQETAVAGKNVIVIGNGASATQ 

F I PAI ADDAAS INQF I RHVRKCAP 

EEQYWSLLTPEYSIGCKRRVFDNDGYLKCLHRPN 

VDITNDPWAVEEQSITTQSGKRF 

PADLIADVDLDYGRCLRLGSLSRSTMSTGEWMG 

VRDKN I GTVS AAYKHLRRWPWRNF 

LTFSTFSDPTLAGAIHQLSIQLKVSWIKQASEE 

RFNTKLKSALRKTVFTNMCRSLPA 

STISLVRVLTVVEEKPSLSLL" 

/locus-tag="Anl8g01470" 

/number=l 

/locus-tag="Anl8g01470" 
/number=l 

/locus-tag="Anl8g01470" 
/number=2 

/locus-tag="Anl8g01470" 
/number=2 

/locus-tag="Anl8g01470" 
/number=3 

/locus-tag="Anl8g01470" 
/number=3 

/locus-tag="Anl8g01470" 
/number=4 

/locus-tag="Anl8g01470" 
/number=4 

/locus-tag="Anl8g01470" 
/number=5 

/locus-tag="Anl8g01470" 
/ number=5 

/locus-tag="Anl8g01470" 
/number=6 

/locus-tag="Anl8g01470" 

/number=6 

/locus-tag="Anl8g01470" 
/number=7 

/locus-tag="Anl8g01470" 
/number=7 

/locus-tag="Anl8g01470" 
/number=8 

/locus-tag="Anl8g01470" 
/number=8 

/locus-tag="Anl8g01470" 



gene 
mRNA 
CDS 



intron 



exon 



exon 



itron 



<111907. .>113460 
<111907. .>113460 
111907. .113460 



111135. .111209 



110961. .111134 



110866. .110960 



110753. .110865 



/locus-tag= 
/number=9 
/locus-tag= 
/number=10 
/locus-tag= 
/number=10 
/locus-tag= 
/number=ll 
/locus-tag= 
/ locus-tag= 
/locus-tag= 



="Anl8g01470" 



="Anl8g01470" 



^"Anl8g01480 
^"Anl8g01480 
^"Anl8g01480 



'Anl8g01470" 



'Anl8g01470 



/inference="prof ile:COGS:COG2124" 
/inference="prof ile:PFAM:PF00067" 
/inf erence="similar to AA 
sequence:UniProtKB:AB013443 .1" 
/note="unnamed protein product; 
Function: eln2 of C. cinereus 
encodes a novel type of microsomal 
cytochrome P450 enzyme, with is 
involved in mushroom 
morphogenesis. Phenotype: a 
dominant mutation of the 
elongationless2 (eln2) gene of the 
mushroom C. cinereus affects 
pattern formation in the 
development of fruit body 
primordia, causing dumpy primordia 
which culminate in mature fruit 
bodies with short stipes. 
Similarity: the ORF shows 
similarity to several cytochrome 
p450 related proteins from 
different species, which have 
different cellular functions. 
Title: strong similarity to 
cytochrome p450 related protein 
eln2 - Coprinus cinereus" 
/citation=[54] 
/ codon-start=l 
/protein-id="CAK47221 . 1" 
/db-xref="GI : 134084188" 
/translation="MPTILLTLLALLTLRLLYEY 
KRDRNLPPGPRRLPLIGNLHQAPQ 
TLPWRVFDQWSKTYGPIMSAQFGRQTLILITSPT 
lARDLLDKRGS I YADRPDLVMANN 
ITKGLHMLIRQYDDWLRLHQRLDAPLLSPRASNT 
YHPIQDLESKQLMFDLLRSNDFDA 
HFERYSGSLMFALAYGFRLLSPKGQELRDMRTIQ 
GNFTYAARVGTWIVDAIPVLNYLP 
AWAPWKRLAEKLFKLEASVHTRHLEKGLNSEPW 
NWSKEFAASKHAEGMPRLDLAYNL 
GILVDAGFETTWTVMKIFVLAMRSDPRFVAVARK 
ELDEWGEDRMPTFEDQEKLVYIQ 
AWDETLRWRSMAPGGIPHAARKEDTYMGYRIPK 
GATVIPLFWSMCLTDEPWDDPLEF 
RPERWFEATEKEEGRFRNFFGYGRRICTGRHIAR 
NSLFLLMARILWAFDIQAPLGDDG 
KPVPVDDMAFDSAFVSTPEPFEALFVPRSEKTKE 
IVEREWNEMEKDMAVLMGQVRDSQ 
RALGLDVRA" 



sig-peptide 



111907. .111957 



/locus-tag="Anl8g01480" 
/ inf erence="protein 



mat -peptide 

gene 
mRNA 

CDS 



111958. .113457 
111907. .113460 

complement (<113565. .>11 

4733) 

complement (join(<113565 

. .114153, 

114210. .>114733) ) 

complement (join (113565. 

.114153, 

114210. .114733) ) 



motif :SignalP: 2.0" 

/locus-tag="Anl8g01480" 

/product="unnamed" 

/locus-tag="Anl8g01480" 

/number=l 

/locus-tag="Anl8g0149 0" 
/locus-tag="Anl8g0149 0" 



/locus-tag="Anl8g0149 0" 



/inference="prof ile:COGS:COG0598" 
/inference="prof ile:PFAM:PF01544" 
/inf erence="similar to AA 
sequence :UniProtKB:SC4 12 93 .1" 
/note="unnamed protein product; 
Function: alr2 of S. cerevisiae is 
an uptake transporter for 
inorganic cations, e. g. 
magnesium, which presence seems to 
be important for resistance to the 
toxic effect of aluminum. Remark: 
alr2 of S. cerevisiae is also 
called YFL050C. Similarity: the 
predicted ORF is 395 amino acids 
shorter at the N-terminus and 57 
amino acids shorter at its 
C-terminal end than alr2 of S. 
cerevisiae (nearly the same is 
true for alrl of S. cerevisiae). 
Title: strong similarity to ion 
transporter Alr2 -Saccharomyces 
cerevisiae plasma membrane" 
/citation=[40] 
/codon-start=l 
/protein-id="CAK47222 .1" 
/db-xref="GI : 134084189" 
/translation="MSSDSTPDPRLEKPRVAVAS 
RLNFFTSQLNTSIEASSIEDLCSV 
YRPFELLLETGAHSGLWWLDITAPSEEDIEALAR 
FFNLHPLTTEDIKTRETREKIELF 
GQYYFLSLRPPRRLETDTGVRIVSHNLYAWFRG 
GVLSFSFDPSLHTSHVRQRIKEHS 
SHLLLTSDWICYALIDDIVDGFAPFISRVENGW 
TVEDSVSITRPDDMGLALQRIFKL 
RKEVMNIRQPLHDKIDVIRSFARHCDISDTSSSQ 
VALYLSDICDHWTMIANLEQAEQ 
MLSRLQSKYLT QVHF D S GRMRNG I AS AL S KL T VL 
AS I LVPMQF I TGLFGMNVRVPGKT 
HDGDNSLTWWFSILGFILGLTVIFAWVAKRIGLL 
DR" 



complement (113565. .1141 
53) 



complement (114154. .1142 
09) 



complement (114210. .1147 
33) 



/locus-tag="Anl8g01490" 



/number=l 

/locus-tag="Anl8g01490" 



/number=l 

/locus-tag="Anl8g01490" 



gene complement (<115549 . .>11 /locus-tag="Anl8g01500" 

6444) 

=> d hist 

(FILE 'HOME' ENTERED AT 13:55:21 ON 07 JUL 2008) 

INDEX 'ADISCTI, ADISINSIGHT, ADISNEWS, AGRICOLA, ANABSTR, ANTE, AQUALINE, 
AQUASCI, BIOENG, BIOSIS, BIOTECHABS, BIOTECHDS, BIOTECHNO, CABA, CAPLUS, 
CEABA-VTB, CIN, CONFSCI, CROPB, CROPU, DDFB, DDFU, DGENE, DISSABS, DRUGS, 
DRUGM0N0G2, DRUGU, EMBAL, EMBASE, ENTERED AT 13:55:47 ON 07 JUL 2008 

SEA LACTOBACILLUS REUTERI AND TOXIN (P) BIND? AND CD4+ CELL? AND 



0* FILE ADISCTI 
SEA LACTOBACILLUS REUTERI AND TOXIN (P) BIND? AND CD4+ CELLS 



0* FILE ADISCTI 
SEA LACTOBACILLUS REUTERI AND TOXIN (P) BIND? 



0* 


FILE 


ADISNEWS 


0* 


FILE 


ANTE 


0* 


FILE 


AQUALINE 


0* 


FILE 


BIOENG 


1* 


FILE 


BIOTECHABS 


1* 


FILE 


BIOTECHDS 


0* 


FILE 


BIOTECHNO 


2 


FILE 


CAPLUS 


0* 


FILE 


CEABA-VTB 


0* 


FILE 


CIN 


0* 


FILE 


ESBIOBASE 


0* 


FILE 


FOMAD 


0* 


FILE 


FOREGE 


2* 


FILE 


FROSTI 


0* 


FILE 


FSTA 


5 


FILE 


GENBANK 


1 


FILE 


IFIPAT 


0* 


FILE 


KOSMET 


0* 


FILE 


NTIS 


0* 


FILE 


NUTRACEUT 


1* 


FILE 


PASCAL 


0* 


FILE 


PHARMAML 


1 


FILE 


SCI SEARCH 


1 


FILE 


TOXCENTER 


21 


FILE 


USPATFULL 


3 


FILE 


USPAT2 


0* 


FILE 


WATER 


2 


FILE 


WPIDS 


2 


FILE 


WPINDEX 



LI QUE LACTOBACILLUS REUTERI AND TOXIN (P) BIND? 



FILE 'BIOTECHDS, CAPLUS, FROSTI, GENBANK, IFIPAT, PASCAL, SCISEARCH, 
TOXCENTER, USPATFULL, USPAT2 ' ENTERED AT 13:59:37 ON 07 JUL 2008 
L2 38 S LI 

L3 3 4 DUP REM L2 (4 DUPLICATES REMOVED) 

L4 27 S L3 AND (TABLET OR FOOD OR DIETARY SUPLEMENT OR CONFECTIONERY 

L5 27 DUP REM L4 (0 DUPLICATES REMOVED) 

=> d 15 1 

L5 ANSWER 1 OF 27 USPATFULL on STN 
AN 2007:296111 USPATFULL 

TI Lactobacillus acidophillus nucleic acid sequences encoding cell surface 



protein homologues and uses therefore 
IN Klaenhammer, Todd R., Raleigh, NC, UNITED STATES 

Altermann, Eric, Palmerston North, NEW ZEALAND 
Buck, B. Logan, Banner Elk, NC, UNITED STATES 
Russell, W. Michael, Newburgh, IN, UNITED STATES 
PA North Carolina State University, Raleigh, NC, UNITED STATES 

corporation ) 
PI US 20070258955 Al 20071108 

AI US 2007-701335 Al 20070201 (11) 

Division of Ser. No. US 2004-831070, filed on 23 Apr 2004, P! 
US 2003-465621P 20030425 (60) 

DT Utility 
FS APPLICATION 
LN.CNT 5104 
INCL INCLM: 
INCLS: 



RLI 
PRAI 



NCL 



NCLM: 
NCLS: 



424/093.400 
435/252.100; 
435/070.100; 
424/093.400 
435/006.000; 
435/320.100; 



435/252.900; 435/320.100; 435/006.000; 
514/002.000; 530/300.000; 530/387.100; 



435/007. 100 
514/002.000 

IC IPCI A61K0035-00 [I, A]; A61K0038-00 

C07H0021-02 [I, A]; C07H0021-00 
C07K0004-00 [I, A]; C12N0001-20 
C12P0021-04 [I, A] 
IPCR A61K0035-00 [I,C] 

A61K0038-00 [I, A]; A61P0001-00 
C07H0021-00 [I,C]; C07H0021-02 
C07K0004-00 [I,C]; C07K0004-00 
C07K0014-335 [I, A]; C07K0016-00 
C12N0001-20 [I,C] 
C12N0001-21 [I, A] 
C12N0015-00 [I,C]; C12N0015-00 
C12P0021-04 [I, A]; C12P0021-06 
C12Q0001-68 [I,C]; C12Q0001-68 

CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



C12Q0001-68 
A61K0035-00 



C12N0001-20 
C12N0009-00 



435/070.100; 435/252.100; 
530/300.000; 530/387.100; 

I, A] ; A61P0001-00 [ 
I,C*]; C07K0016-00 [ 
I, A]; C12N0015-00 [I 



I, A] 

I, A] ; A61K0038-00 
I, C] ; A61P0001-00 [I 
I, A]; C07H0021-04 [I 
I, A]; C07K0014-195 [ 
[I,C]; C07K0016-00 [ 
I, A]; C12N0001-21 [I 
I,C*]; C12N0009-00 [ 
I, A]; C12P0021-04 [I 
I,C*]; C12P0021-06 [ 
I, A] 



435/007.100 
536/023.100 

435/252.900 
536/023.100 
,A] ; 
I, A] ; 
,A] ; 

C] ; 
A] ; 
A] ; 
I,C*] ; 
I, A] ; 

C*] ; 
I, A] ; 

C] ; 
I, A] ; 



L5 ANSWER 2 OF 27 USPATFULL on STN 
AN 2007:197155 USPATFULL 

TI Lactobacillus acidophilus nucleic acid sequences encoding cell surface 

protein homologues and uses therefore 
IN Klaenhammer, Todd R., Raleigh, NC, UNITED STATES 

Altermann, Eric, Palmerston North, NEW ZEALAND 

Buck, B. Logan, Banner Elk, NC, UNITED STATES 

Russell, W. Michael, Newburgh, IN, UNITED STATES 
PA North Carolina State University, Raleigh, NC, UNITED STATES, 27695-821 

(U.S. corporation) 
PI US 20070172495 Al 20070726 

AI US 2007-701319 Al 20070201 (11) 

RLI Division of Ser. No. US 2004-831070, filed on 23 Apr 2004, PENDING 

PRAI US 2003-465621P 20030425 (60) 

DT Utility 

FS APPLICATION 

LN.CNT 5104 

INCL INCLM: 424/234.100 

INCLS: 514/044.000; 435/006.000; 435/007.320; 435/069.100; 435/252.900 
435/471.000; 530/350.000; 536/023.700 
NCL NCLM: 424/234.100 

NCLS: 435/006.000; 435/007.320; 435/069.100; 435/252.900; 435/471.000 
514/044.000; 530/350.000; 536/023.700 



IC IPCI A61K0048-00 [I, A]; A61K0039-02 [I, A]; C12Q0001-68 [I,A]; 

G01N0033-554 [I, A]; C07H0021-04 [I, A]; C07H0021-00 [I,C*]; 

C12P0021-06 [I, A]; C07K0014-335 [I, A]; C07K0014-195 [I,C*] 
IPCR A61K0048-00 [I,C]; A61K0048-00 [I, A]; A61K0039-02 [I,C]; 

A61K0039-02 [I,A]; C07H0021-00 [I,C]; C07H0021-04 [I,A]; 

C07K0014-195 [I,C]; C07K0014-335 [I, A]; C12N0001-21 [I,C*]; 

C12N0001-21 [I,A]; C12N0009-00 [I,C*]; C12N0009-00 [I,A]; 

C12P0021-06 [I,C]; C12P0021-06 [I, A]; C12Q0001-68 [I,C]; 

C12Q0001-68 [I, A]; G01N0033-554 [I,C]; G01N0033-554 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L5 ANSWER 3 OF 2 7 USPATFULL on STN 

AN 2007:140436 USPATFULL 

TI Delivery of trefoil peptides 

IN Hans, Wolfgang Christian, Landshut, GERMANY, FEDERAL REPUBLIC OF 

Steidler, Lothar, Lokeren, BELGIUM 

Remaut, Erik Rene, Lovendegem, BELGIUM 
PI US 20070122427 Al 20070531 

AI US 2007-654879 Al 20070118 (11) 

RLI Division of Ser. No. US 2002-30390, filed on 16 Apr 2002, PENDING A 371 

of International Ser. No. WO 2000-EP6343, filed on 5 Jul 2000 
PRAI EP 1999-870143 19990705 

DT Utility 

FS APPLICATION 

LN.CNT 1335 

INCL INCLM: 424/200.100 

INCLS: 435/252.300; 435/252.900 
NCL NCLM: 424/200.100 

NCLS: 435/252.300; 435/252.900 
IC IPCI A61K0039-02 [I, A]; C12N0001-21 [I, A] 

IPCR A61K0039-02 [I,C]; A61K0039-02 [I, A]; C12N0001-21 [I,C]; 
C12N0001-21 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



ANSWER 4 OF 27 USPATFULL on STN 

2007:134502 USPATFULL 

Corynebacterium glutamicum genes encoding metabolic pathway proteins 

Pompejus, Markus, Freinsheim, GERMANY, FEDERAL REPUBLIC OF 

Kroger, Burkhard, Limburgerhof , GERMANY, FEDERAL REPUBLIC OF 

Schroder, Hartwig, Nussloch, GERMANY, FEDERAL REPUBLIC OF 

Zelder, Oskar, Speyer, GERMANY, FEDERAL REPUBLIC OF 

Haberhauer, Gregor, Limburgerhof, GERMANY, FEDERAL REPUBLIC OF 

BASF Aktiengesellschaft, Ludwigshaf en, GERMANY, FEDERAL REPUBLIC OF 

(non-U. S. corporation) 

US 20070117183 Al 20070524 

US 2006-511140 Al 20060828 (11) 

Division of Ser. No. US 2005-55822, filed on 11 Feb 2005, PENDING 
Continuation of Ser. No. US 2000-606740, filed on 23 Jun 2000, ABANDONED 

DE 1999-19932125 19990709 
DE 1999-19932227 19990709 
DE 1999-19932228 19990709 
DE 1999-19932230 19990709 
DE 1999-19933005 19990714 
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AB Nutritional compositions are provided which comprise oligof ructose. 



sialyllactose and probiotic bacteria, which are useful in the 
eradication of pathogenic microorganisms in the gastrointestinal tracts 
of patients. 
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SUMM [0004] Prebiotics are nondigestible food ingredients that that 

beneficially affect the host by selectively stimulating the growth 
and/or activity of one or a limited number. 

SUMM [0011] Sialyllactose is Icnown to have anti-adhesive properties for 

specific pathogenic bacteria. For example, sialyllactose acts to inhibit 

cholera toxin invitro (Idota et al . , "Inhibition of Cholera 

Toxin by Human MilJc Fractions and Sialyllactose," Biosci. 

Biotech. Biochem. 59:417-419) and Helicobacter pylori (Simon et al . , 

"Inhibition of Helicobacter pylori Binding to Gastrointestinal 

Epithelial Cells by Sialic Acid-Containing Oligosaccharides, " Infection 

and Immunity, 750-757, (1997)). 

SUMM [0015] Probiotics are live microbial food ingredients that 

have a beneficial effect on human health. (Salminen et al., "Functional 
food science and gastrointestinal physiology and function." 
Brit. J. Nutr. 80(suppl. 1):S147-S171 (1998)). 

SUMM . . . carbohydrate to lactic acid. The specific strains most often 

studied include members of the genera Lactobacillus and Bifidobacterium. 
(Sanders, "Probiotics." Food Technol. 53:67-77 (1999)). 

SUMM [0017] Some lactic acid bacteria specifically produce lactic acid as a 
major product of their metabolism. Some produce predominantly 
the levorotary "L"-form of lactic acid [L(+) -lactic acid], others 
produce predominantly the dextrorotary "D"-form. 

SUMM [0019] A milk product containing LGG significantly shortened 

the duration of diarrhea in young children. (Kaila et al., "Enhancement 
of the circulating antibody secreting. 

SUMM [0021] U.S. Pat. No. 5,908,646 discloses a method for inhibiting the 
growth or activity of Clostridium species in a human food 
product by adding an effective amount of the beneficial 
microorganism, L. rhamnosus [L. casei subspecies rhamnosus], which 
produces predominantly L(+) -lactic acid. 

SUMM . . . viable cells of three specific microorganisms beneficial to the 
human intestinal microorganisms for preventing diarrhea. Specifically, 
the three microorganisms are Lactobacillus reuteri, 

Lactobacillus acidophilus and Bifidobacterium infantis. The diarrhea can 
be caused by antibiotic treatment or by infection with a virus, a. 
SUMM . . . "Bifidobacterium bifidum." (Fukushima et al . , "Effect of a 

probiotic formula on intestinal immunoglobulin A production in healthy 
children." Int. J. Food Microb. 42-39-44 (1998)). 
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AN 2004:12970 USPATFULL 

TI Polynucleotides, materials incorporating them, and methods for using 

them 

IN Glenn, Matthew, Whenuapai, NEW ZEALAND 

Havultltala, llkka J., Remuera, NEW ZEALAND 

Lubbers, Marlt, Palmerston North, NEW ZEALAND 

Deltlter, James, Palmerston North, NEW ZEALAND 
PA GENESIS RESEARCH AND DEVELOPMENT CORP. LTD., Parnell, NEW ZEALAND 

(non-U. S. corporation) 
PI US 20040009490 Al 20040115 

US 7125698 B2 20061024 



us 2002-264213 Al 20021003 (10) 

Continuation-in-part of Ser. No. US 2001-971536, filed on 2 Oct 2001, 
PENDING Continuation-in-part of Ser. No. US 2000-634238, filed on 8 Aug 
2000, GRANTED, Pat. No. US 6544772 



PRAI US 1999-147853P 
US 1999-147852P 
US 1999-152032P 
US 1999-152031P 

DT Utility 

FS APPLICATION 

LN.CNT 5375 



19990809 (60) 

19990809 (60) 

19990901 (60) 

19990901 (60) 



INCL 



IC 



INCLM: 435/006.000 

INCLS: 435/069.100; 435/252.300; 435/320.100; 530/350.000; 536/023.200 

NCLM: 435/193.000; 435/006.000 

NCLS: 426/534.000; 435/183.000; 435/194.000; 530/350.000; 435/069.100; 

435/252.300; 435/320.100; 536/023.200 

[7] 
ICM 
ICS 
IPCI 



C12Q001-68 

C07H021-04; C12P021-02; C12N001-21; C12N015-74; C07K014-335 
C12Q0001-68 [ICM, 7]; C07H0021-04 [ICS, 7]; C07H0021-00 [ICS,7,C* 
C12P0021-02 [ICS, 7]; C12N0001-21 [ICS, 7]; C12N0015-74 [ICS, 7]; 
C07K0014-335 [ICS, 7]; C07K0014-195 [ICS,7,C*] 
IPCI-2 C12N0009-10 [I, A] 

IPCR C12N0009-10 [I,C]; C12N0009-10 [I, A]; A23C0019-00 [I,C*]; 

A23C0019-032 [I, A]; A61K0038-00 [N,C*]; A61K0038-00 [N,A]; 
C07K0014-195 [I,C*]; C07KQ014-335 [I, A]; C12N0001-21 [I,C*]; 
C12N0001-21 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L5 ANSWER 19 OF 27 USPATFULL on STN 
AN 2004:250212 USPATFULL 

TI Nucleic acid and amino acid sequences relating to Streptococcus 

pneumoniae for diagnostics and therapeutics 
IN Doucette-Stamm, Lynn A., Framingham, MA, United States 

Bush, David, Somerville, MA, United States 
PA Genome Therapeutics Corporation, Waltham, MA, United States (U.S. 

corporation) 
PI US 6800744 Bl 20041005 

AI US 1998-107433 19980630 (9) 

PRAI US 1998-85131P 19980512 (60) 

US 1997-51553P 19970702 (60) 

DT Utility 
FS GRANTED 
LN.CNT 11545 

INCL INCLM: 536/023.100 

INCLS: 435/006.000; 435/320.100; 435/325.000; 435/254.000; 435/419.000; 
536/024.100; 536/023.400; 536/024.320 
NCL NCLM: 536/023.100 

NCLS: 435/006.000; 435/320.100; 435/325.000; 435/419.000; 536/023.400; 
536/024.100; 536/024.320 



ICM C12Q001-68 

ICS C12N001-14; C12N015-00; C12N005-00; C12N005-04; C07H021-02; 
C07H021-04 

IPCI C12Q0001-68 [ICM, 7]; C12N0001-14 [ICS, 7]; C12N0015-00 [ICS, 7]; 

C12N0005-00 [ICS, 7]; C12N0005-04 [ICS, 7]; C07H0021-02 [ICS, 7]; 

C07H0021-04 [ICS, 7]; C07H0021-00 [ICS,7,C*] 
IPCR C07H0021-00 [I,C*]; C07H0021-02 [I, A]; C07H0021-04 [I, A]; 

C07K0014-195 [I,C*]; C07K0014-315 [I, A] 



EXF 536/23.1; 536/23.4; 536/24.32; 435/320.1; 435/325; 435/254.11; 435/419; 
435/6 

CAS INDEXING IS AVAILABLE FOR THIS PATENT. 
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2003:71519 USPATFULL 

Corynebacterium glutamicum genes encoding metabolic pathway proteins 

Pompejus, Markus, Freinsheim, GERMANY, FEDERAL REPUBLIC OF 

Kroger, Burkhard, Limburgerhof , GERMANY, FEDERAL REPUBLIC OF 

Schroder, Hartwig, Nussloch, GERMANY, FEDERAL REPUBLIC OF 

Zelder, Oskar, Speyer, GERMANY, FEDERAL REPUBLIC OF 

Haberhauer, Gregor, Limburgerhof, GERMANY, FEDERAL REPUBLIC OF 

Kim, Jun-Won, Seoul, KOREA, REPUBLIC OF 

Lee, Heung-Shick, Seoul, KOREA, REPUBLIC OF 

Hwang, Byung-Joon, Seoul, KOREA, REPUBLIC OF 

US 20030049804 Al 20030313 

US 2000-746660 Al 20001222 (9) 

Continuation-in-part of Ser. No. US 2000-606740, filed on 23 Jun 2000, 
PENDING Continuation-in-part of Ser. No. US 2000-603124, filed on 23 Jun 
2000, PENDING 

19990708 
19990625 (60) 
19990702 (60) 
19990812 (60) 
20000309 (60) 



PRAI DE 1999-19931420 
US 1999-141031P 
US 1999-142101P 
US 1999-148613P 
US 2000-187970P 

DT Utility 

FS APPLICATION 

LN.CNT 15004 

INCL INCLM: 
INCLS: 

NCL NCLM: 



IC 



NCLS: 
[7] 
ICM 
ICS 
IPCI 



435/115.000 
435/069.100; 
435/115.000 
435/069.100; 



435/252.300; 435/320.100; 435/183.000; 536/023.200 
435/183.000; 435/252.300; 435/320.100; 536/023.200 



C12P013-08 

C07H021-04; C12N009-00; C12P021-02; C12N001-21 

C12P0013-08 [ICM, 7]; C12P0013-00 [ICM, 7,C*]; C07H0021-04 [ICS, 7]; 
C07H0021-00 [ICS,7,C*]; C12N0009-00 [ICS, 7]; C12P0021-02 [ICS, 7]; 
C12N0001-21 [ICS, 7] 
IPCR C07K0014-195 [I,C*]; C07K0014-34 [I, A]; C12N0001-21 [I,C*]; 
C12N0001-21 [I, A]; C12N0009-00 [I,C*]; C12N0009-00 [I, A]; 
C12N0009-18 [1,0*]; C12N0009-18 [I, A]; C12N0009-90 [I,C*]; 
C12N0009-90 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 
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AN 2003:95966 USPATFULL 

TI Polynucleotides, materials incorporating them, and methods for using 

them 

IN Glenn, Matthew, Auckland, NEW ZEALAND 

Havukkala, Ilkka J., Auckland, NEW ZEALAND 

Bloksberg, Leonard N., Auckland, NEW ZEALAND 

Lubbers, Mark W., Palmer ston North, NEW ZEALAND 

Dekker, James, Palmerston North, NEW ZEALAND 

Christensson, Anna C, Lund, SWEDEN 

Holland, Ross, Palmerson North, NEW ZEALAND 

0' Toole, Paul W., Palmerston North, NEW ZEALAND 

Reid, Julian R., Palmerston North, NEW ZEALAND 

Coolbear, Timothy, Palmerston North, NEW ZEALAND 
PA Genesis Research & Development Corp. Ltd, Parnell, NEW ZEALAND (non-U. S. 



corporation) 

Via Lachia Bioscience (NZ) Ltd., Auckland, NEW ZEALAND (non-U. S. 

corporation) 
PI US 6544772 Bl 20030408 

AI US 2000-634238 20000808 (9) 

DT Utility 
FS GRANTED 
LN.CNT 2015 

INCL INCLM: 435/252.300 

435/320.100; 536/023.700 
NCL NCLM: 435/252.300 

435/320.100; 536/023.700 



IC 



INCLM: 
INCLS: 
NCLM: 
NCLS: 
[7] 
I CM 
ICS 
IPCI 
IPCR 



C12N001-21 

C12N015-63; C12N015-31 

C12N0001-21 [ICM, 7]; C12N0015-63 [ICS, 7]; C12N0015-31 [ICS 
A23C0019-00 [I,C*]; A23C0019-032 [I, A]; C07K0014-195 [I,C* 
C07K0014-335 [I, A]; C12N0001-21 [I,C*]; C12N0001-21 [I, A]; 
C12N0015-31 [I,C*]; C12N0015-31 [I, A] 
!.3; 435/320.1; 536/23.7 



CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



ANSWER 22 OF 27 BIOTECHDS COPYRIGHT 2008 THOMSON REUTERS on STN 
1999-00562 BIOTECHDS 

Use of lactic and propionic acid bacteria; 

to bind mycotoxin to prevent their absorption or to purify 

contaminated food or feedstuff 
El-Nezami H; Kankaanpaa P; Salminen S; Ahokas J 
Roy .Melbourne- Inst . Technol . 
Melbourne, Victoria, Australia. 
WO 9834503 13 Aug 1998 
WO 1998-AU63 6 Feb 1998 
AU 1997-5005 7 Feb 1997 
Patent 
English 

WPI: 1998-557001 [49] 
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GENBANK® COPYRIGHT 2008 on STN 



LOCUS (LOC) : 

GenBank ACC. NO. (GBN) : 

GenBank VERSION (VER) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

SOURCE : 
ORGANISM (ORGN) : 



CP000705 GenBank (R) 

CP000705 AAOVOIOOOOOO AAOV01000001-AAOV01000097 
CP000705.1 GI:148530277 

1999618 

DNA; circular 

Bacteria 

17 Oct 2007 

Lactobacillus reuteri F275, 
complete genome. 
Lactobacillus reuteri F275 
Lactobacillus reuteri F275 
Bacteria; Firmicutes; Lactobacillales ; 
Lactobacillaceae; Lactobacillus 
GenomeProject: 15766 



PROJECT (PJID) : 
COMMENT : 

URL — http://www.jgi.doe.gov 

JGI Project ID: 4000135 

Source DNA available from Gerald Tannock 
(gerald . tannockgstonebow. otago . ac .nz) 
Bacteria available from DSMZ : DSM 20016 

Contacts: Gerald Tannock (gerald. tannock@stonebow.otago.ac.nz) 

Paul Richardson (microbes@cuba.jgi-psf.org) 
Quality assurance done by JGI-Stanford 



Annotation done by JGI-ORNL and JGI-PGF 
Finishing done by JGI-PGF 

Finished microbial genomes have been curated to close all gaps with 
greater than 98% coverage of at least two independent clones. Each 
base pair has a minimum q (quality) value of 30 and the total error 
rate is less than one per 50000. 

The JGI and collaborators endorse the principles for the 
distribution and use of large scale sequencing data adopted by the 
larger genome sequencing community and urge users of this data to 
follow them. It is our intention to publish the v/ork of this 
project in a timely fashion and we welcome collaborative 
interaction on the project and analysis, 
(http : / /www . genome . gov/page . cfm?pageID=l 05063 76 ) . 



REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 

JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO) : 



(bases 1 to 1999618) 
Copeland,A.; Lucas, S.; Lapidus,A.; Barry, K. ; 
Detter,J.C.; Glavina del Rio,T.; Hammon,N.; Israni,S. 
Dalin,E.; Tice,H.; Pitluck,S.; Goltsman, E . ; Schmutz,J 
Larimer, F.; Land,M.; Hauser,L.; Kyrpides, N. ; Kim,E.; 
Walter, J.; Heng,N.C.K. ; Tannock, G.W. ; Richardson,?. 
Complete sequence of chromosome of 
Lactobacillus reuteri DSM 20016 
Unpublished 

2 (bases 1 to 1999618) 

Copeland,A.; Lucas, S.; Lapidus,A.; Barry, K.; 
Detter,J.C.; Glavina del Rio,T.; Hammon,N.; Israni,S. 
Dalin,E.; Tice,H.; Pitluck,S.; Goltsman, E . ; Schmutz,J 
Larimer, F.; Land,M.; Hauser,L.; Kyrpides , N . ; Kim,E.; 
Walter, J.; Heng,N.C.K. ; Tannock, G.W. ; Richardson,?. 
Direct Submission 

Submitted (17-MAY-200 7) US DOE Joint Genome Institute 
2800 Mitchell Drive BlOO, Walnut Creek, CA 94598-1698 
USA 



FEATURES (FEAT) : 
Feature Key 



Qualifier 



1. .1999618 



/organism=" Lactobacillus reuteri 
F275" 

/mol-type="genomic DNA" 
/strain="DSM 20016" 
/db-xref="taxon:299033" 
/note="F275 = DSM 20016 = JCM 
1112" 
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GENBANK® COPYRIGHT 2008 on STN 



LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 

SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 
SOURCE: 
ORGANISM (ORGN) : 



REFERENCE : 



GenBank (R) 
GI:134084136 



AM270397 
AM270397 
AM270397.1 
928607-03- 

163680 

DNA; linear 

Plants, fungi, algae 

2 4 Mar 2007 

Aspergillus niger contig Anl8c0040, complete genome. 

Aspergillus niger 
Aspergillus niger 

Eukaryota; Fungi; Ascomycota; Pezizomycotina; 
Eurotiomycetes; Eurotiales; Trichocomaceae; mitospori 
Trichocomaceae; Aspergillus 
1 (bases 150631 to 153085) 



AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 



Habison,A.; Kubicek, C . P . ; Rohr,M. 

Partial purification and regulatory properties of 
phosphof ructokinase from Aspergillus niger 
Biochem. J., 209 (3), 669-676 (1983) 
CA 99:171737 

2 (bases 71863 to 73356) 
Manney, T . R. 

Expression of the BARl gene in Saccharomyces 
cerevisiae: induction by the alpha mating pheromone of 
an activity associated with a secreted protein 
J. Bacterid., 155 (1), 291-301 (1983) 
CA 99:102121 

3 (bases 71863 to 73356) 
Jones, E .W. 

The synthesis and function of proteases in 
Saccharomyces: genetic approaches 
Annu. Rev. Genet., 18, 233-270 (1984) 
CA 102:92538 

4 (bases 144032 to 145578) 
Brandsch, R. ; Bichler,V. 

In vivo and in vitro expression of the 
6-hydroxy-D-nicotine oxidase gene of Arthrobacter 
oxidans, cloned into Escherichia coli, as an 
enzymatically active, covalently flavinylated 
polypeptide 

FEBS Lett., 192 (2), 204-208 (1985) 
CA 104:29597 

5 (bases 150631 to 153085) 
Schreferl,G. ; Kubicek, C . P . ; Rohr,M. 

Inhibition of citric acid accumulation by manganese 
ions in Aspergillus niger mutants with reduced citrate 

control of phosphof ructokinase 

J. Bacteriol., 16b (3), 1019-1022 (1986) 

CA 104:165023 

6 (bases 156827 to 158638) 

Whittington, H.A. ; Grant, S.; Roberts , C . F . ; Lamb,H.; 
Hawkins, A. R. 

Identification and isolation of a putative permease 

gene in the quinic acid utilization (QUT) gene cluster 

of Aspergillus nidulans 

Curr. Genet., 12 (2), 135-139 (1987) 

CA 107:91128 

7 (bases 54105 to 55762) 

Chisholm,V.T. ; Lea,H.Z.; Rai,R.; Cooper, T.G. 
Regulation of allantoate transport in wild-type and 
mutant strains of Saccharomyces cerevisiae 
J. Bacterid., 169 (4), 1684-1690 (1987) 
CA 106:172689 

8 (bases 71863 to 73356) 

MacKay,V.L. ; Welch, S.K.; Insley, M. Y . ; Manney, T.R. ; 
Holly, J.; Saari,G.C.; Parker, M.L. 

The Saccharomyces cerevisiae BARl gene encodes an 

exported protein with homology to pepsin 

Proc. Natl. Acad. Sci. U.S.A., 85 (1), 55-59 (1988) 

CA 108:125658 

9 (bases 54105 to 55762) 

Rai,R.; Genbauf f e, F . S . ; Cooper, T.G. 

Structure and transcription of the allantoate permease 

gene (DAL5) from Saccharomyces cerevisiae 

J. Bacterid., 170 (1), 266-271 (1988) 

CA 109:17930 

10 



AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 



Hawkins, A. R. ; Lamb,H.K.; Smith, M.; KeYte,J.W.; 
Roberts, C.F. 

Molecular organisation of the quinic acid utilization 
(QUT) gene cluster in Aspergillus nidulans 
Mol. Gen. Genet., 214 (2), 224-231 (1988) 
CA 111:188522 

11 (bases 68231 to 69040) 

Hon jo, M.; Nakayama, A . ; Fukazawa, K . ; Kawamura, K . ; 
Ando,K.; Furutani,Y. 

A novel Bacillus subtilis gene involved in negative 
control of sporulation and degradative-enzyme 
production 

J. Bacteriol., 172 (4), 1783-1790 (1990) 
CA 113:146166 

12 (bases 5212 to 7379) 

Nikawa,J.; Hosaka,K.; Tsukagoshi , Y . ; Yamashita, S . 

Primary structure of the yeast choline transport gene 

and regulation of its expression 

J. Biol. Chem., 265 (26), 15996-16003 (1990) 

CA 114:18549 

13 (bases 159975 to 161849) 

Fling, M.E.; Kopf,J.; Tamarkin,A.; Gorman, J .A. , • 
Smith, H. A.; Koltin,Y. 

Analysis of a Candida albicans gene that encodes a 
novel mechanism for resistance to benomyl and 

methotrexate 

Mol. Gen. Genet., 227 (2), 318-329 (1991) 
CA 116:100405 

14 (bases 28874 to 30837) 
Masuda,N.; Kitamura,N.; Saito,K. 

Primary structure of protein moiety of Penicillium 
notatum phospholipase B deduced from the cDNA 
Eur. J. Biochem., 202 (3), 783-787 (1991) 

CA 118:76020 

15 (bases 100363 to 101496) 
Sakoda,H.; Imanaka,T. 

Cloning and sequencing of the gene coding for alcohol 

dehydrogenase of Bacillus stearothermophilus and 

rational shift of the optimum pH 

J. Bacterid., 174 (4), 1397-1402 (1992) 

CA 118:186708 

16 (bases 20773 to 22555) 
Kamoda,S.; Saburi,Y. 

Structural and enzymatical comparison of 
lignostilbene-alpha, beta-dioxygenase isozymes, I, II, 
and III, from Pseudomonas paucimobilis TMY1009 
Biosci. Biotechnol. Biochem., 57 (6), 931-934 (1993) 
CA 119:154577 

17 (bases 80602 to 83154) 

ElBerry, H.M. ; Ma jumdar , M. L . ; Cunningham, T . S . ; 
Sumrada, R.A. ; Cooper, T.G. 

Regulation of the urea active transporter gene (DUR3) 

in Saccharomyces cerevisiae 

J. Bacterid., 175 (15), 4688-4698 (1993) 

CA 119:242524 

18 (bases 23096 to 24052) 
Roper, D. I.; Cooper, R.A. 

Purification, nucleotide sequence and some properties 
of a bifunctional isomerase/decarboxylase from the 
homoprotocatechuate degradative pathway of Escherichia 
coli C 

Eur. J. Biochem., 217 (2), 575-580 (1993) 



OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 



CA 119:265270 

19 (bases 5212 to 7379) 
Li,Z.; Brendel,M. 

Co-regulation with genes of phospholipid biosynthesis 
of the CTR/HNMl-encoded choline/nitrogen mustard 
permease in Saccharomyces cerevisiae 
Mol. Gen. Genet., 241 (5-6), 680-684 (1993) 

CA 121:28475 

20 (bases 159975 to 161849) 

Ben-Yaacov, R. ; Knoller,S.; Caldwell, G. A. ; Becker, J. M. 
Koltin, Y. 

Candida albicans gene encoding resistance to benomyl 
and methotrexate is a multidrug resistance gene 
Antimicrob. Agents Chemother., 38 (4), 648-652 (1994) 
CA 121:2236 

21 (bases 150631 to 153085) 
Legisa,M.; Bencina,M. 

Evidence for the activation of 6-phosphof ructo-l-kina 
by cAMP-dependent protein kinase in Aspergillus niger 
FEMS Microbiol. Lett., 118 (3), 327-333 (1994) 
CA 121:102759 

22 (bases 138020 to 140061) 

Kim, Y.J. ; Bjorklund, S . ; Li,Y.; Sayre,M.H.; 
Kornberg, R. D. 

A multiprotein mediator of transcriptional activation 

and its interaction with the C-terminal repeat domain 

of RNA polymerase II 

Cell, 77 (4), 599-608 (1994) 

CA 121:102149 

23 (bases 38805 to 39836) 
Downs, D.M.; Petersen, L. 

apbA, a new genetic locus involved in thiamine 
biosynthesis in Salmonella typhimurium 
J. Bacteriol., 176 (16), 4858-4864 (1994) 
CA 121:197350 

24 (bases 5212 to 7379) 
Li,Z.; Brendel,M. 

Sensitivity to nitrogen mustard in Saccharomyces 
cerevisiae is independently determined by regulated 
choline permease and DNA repair 
Mutat. Res., 315 (2), 139-145 (1994) 
CA 121:198201 

25 (bases 85096 to 90863) 

Szczypka, M. S . ; Wemmie,J.A.; Moye-Rowley, W. S . ; 
Thiele, D. J. 

A yeast metal resistance protein similar to human 
cystic fibrosis transmembrane conductance regulator 
(CFTR) and multidrug resistance-associated protein 
J. Biol. Chem., 269 (36), 22853-22857 (1994) 
CA 122:26140 

26 (bases 91484 to 93088) 

Knight, S .A. ; Tamai,K.T.; Kosman, D . J . ; Thiele, D.J. 
Identification and analysis of a Saccharomyces 
cerevisiae copper homeostasis gene encoding a 

homeodomain protein 

Mol. Cell. Biol., 14 (12), 7792-7804 (1994) 

CA 122:73697 

27 (bases 85096 to 90863) 

Wemmie, J.A. ; Szczypka, M. S . ; Thiele, D . J . ; 
Moye-Rowley, W. S . 

Cadmium tolerance mediated by the yeast AP-1 protein 
requires the presence of an ATP-binding cassette 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



transporter-encoding gene, YCFl 

J. Biol. Chem., 269 (51), 32592-32597 (1994) 

CA 121:294343 

28 (bases 20773 to 22555) 
Kamoda,S.; Saburi,Y. 

Cloning of a lignostilbene-alpha, beta-dioxygenase 
isozyme gene from Pseudomonas paucimobilis TMY1009 
Biosci. Biotechnol. Biochem., 59 (10), 1866-1868 (1995) 
CA 124:47043 

29 (bases 85096 to 90863) 

Li,Z.S.; Szczypka,M. ; Lu,y.P.; Thiele, D . J . ; Rea,P.A. 

The yeast cadmium factor protein (YCFl) is a vacuolar 

glutathione S-conjugate pump 

J. Biol. Chem., 271 (11), 6509-6517 (1996) 

CA 124:226108 

30 (bases 33419 to 35477) 

Silva,J.C.; Minto,R.E.; Barry, C.E. Ill; Holland, K. A. ; 
Townsend, C . A. 

Isolation and characterization of the versicolorin B 
synthase gene from Aspergillus parasiticus. Expansion 
of the aflatoxin bl biosynthetic gene cluster 
J. Biol. Chem., 271 (23), 13600-13608 (1996) 
CA 125:50401 

31 (bases 15614 to 17392) 

Covert, S.F. ; Enkerli,J.; Miao,V.P.; VanEtten, H. D. 

A gene for maackiain detoxification from a dispensable 

chromosome of Nectria haematococca 

Mol. Gen. Genet., 251 (4), 397-406 (1996) 

CA 125:106677 

32 (bases 131646 to 132375) 
Mitsuhashi, S . ; Miyachi,S. 

Amino acid sequence homology between N- and C-terminal 
halves of a carbonic anhydrase in Porphyridium 
purpureum, as deduced from the cloned cDNA 
J. Biol. Chem., 271 (45), 28703-28709 (1996) 
CA 125:321312 

33 (bases 85096 to 90863) 

Li,Z.S.; Lu,Y.P.; Zhen,R.G.; Szczypka, M. ; Thiele, D.J. ; 
Rea,P.A. 

A new pathway for vacuolar cadmium sequestration in 
Saccharomyces cerevisiae: YCFl-catalyzed transport of 

bis (glutathionato) cadmium 

Proc. Natl. Acad. Sci. U.S.A., 94 (1), 42-47 (1997) 

CA 126:127966 

34 (bases 1241 to 3043) 

Saito,Y.; Ishii,Y.; Hayashi,H.; Imao,Y.; Akashi,T.; 
Yoshikawa, K. ; Noguchi,Y.; Soeda,S.; Yoshida,M.; 
Niwa,M.; Hosoda,J.; Shimomura,K. 

Cloning of genes coding for L-sorbose and L-sorbosone 
dehydrogenases from Gluconobacter oxydans and microbial 
production of 2-keto-L-gulonate, a precursor of 
L-ascorbic acid, in a recombinant G. oxydans strain 
Appl. Environ. Microbiol., 63 (2), 454-460 (1997) 
CA 126:167180 

35 (bases 150631 to 153085) 
Ruijter,G. J. ; Panneman, H . ; Visser,J. 
Overexpression of phosphof ructokinase and pyruvate 
kinase in citric acid-producing Aspergillus niger 
Biochim. Biophys. Acta, 1334 (2-3), 317-326 (1997) 
CA 126:222695 

36 (bases 49850 to 51919) 
Prieto,R.; Woloshuk, C . P . 



TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 



ordl, an oxidoreductase gene responsible for conversion 
of 0-methylsterigmatocystin to aflatoxin in Aspergillus 
f lavus 

Appl. Environ. Microbiol., 63 (5), 1661-1666 (1997) 

CA 127:13942 

37 (bases 17766 to 20491) 

Todd,R.B.; Murphy , R . L . ; Martin, H .M. ; Sharp, J. A., • 
Davis, M. A.; Katz,M.E.; Hynes,M.J. 
The acetate regulatory gene facB of Aspergillus 
nidulans encodes a Zn(II)2Cys6 transcriptional 

activator 

Mol. Gen. Genet., 254 (5), 495-504 (1997) 
CA 127:105007 

38 (bases 71863 to 73356) 
Ballensief en, W. ; Schmitt , H . D . 

Periplasmic Barl protease of Saccharomyces cerevisiae 
is active before reaching its extracellular destination 
Eur. J. Biochem., 247 (1), 142-147 (1997) 
CA 127:187926 

39 (bases 52148 to 53720) 

Kuroyanagi, N. ; Onogi,H.; Wakabayashi , T . ; Hagiwara,M. 
Novel SR-protein-specif ic kinase, SRPK2, disassembles 
nuclear speckles 

Biochem. Biophys. Res. Commun., 242 (2), 357-364 (1998) 
CA 128:241040 

40 (bases 113565 to 114733) 
MacDiarmid, C.W. ; Gardner, R.C. 

Overexpression of the Saccharomyces cerevisiae 
magnesium transport system confers resistance to 
aluminum ion 

J. Biol. Chem., 273 (3), 1727-1732 (1998) 
CA 128:177021 

41 (bases 135940 to 137166) 

Wang,H.Y.; Lin,W.; Dyck,J.A.; Yeakley, J.M. ; 
Songyang, Z . ; Cantley, L.C. ; Fu,X.D. 

SRPK2 : a differentially expressed SR protein-specific 
kinase involved in mediating the interaction and 
localization of pre-mRNA splicing factors in mammalian 
cells 

J. Cell Biol., 140 (4), 737-750 (1998) 
CA 128:306636 

42 (bases 38805 to 39836) 
Frodyma,M.E. ; Downs, D. 

ApbA, the ketopantoate reductase enzyme of Salmonella 
typhimurium is required for the synthesis of thiamine 
via the alternative pyrimidine biosynthetic pathway 
J. Biol. Chem., 273 (10), 5572-5576 (1998) 
CA 128:305459 

43 (bases 106231 to 106803) 
Woloshuk,C.P. ; Prieto,R. 

Genetic organization and function of the aflatoxin Bl 

biosynthetic genes 

FEMS Microbiol. Lett., 160 (2), 169-176 (1998) 
CA 128:290649 

44 (bases 93571 to 94963) 
Philp,N.J.; Yoon,H.; Grollman, E . F . 

Monocarboxylate transporter MCTl is located in the 
apical membrane and MCT3 in the basal membrane of rat 
RPE 

Am. J. Physiol., 274 (6 PT 2), R1824-R1828 (1998) 
CA 129:159493 

45 (bases 93571 to 94963) 



AUTHOR (AU) : 



TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 



Wilson, M.C.; Jackson, V.N. ; Heddle, C . ; Price,N.T.; 
Pilegaard, H . ; Juel,C.; Bonen,A.; Montgomery, I . ; 
Hutter,O.F. ; Halestrap, A.P. 

Lactic acid efflux from white skeletal muscle is 
catalyzed by the monocarboxylate transporter isoform 

MCT3 

J. Biol. Chem., 273 (26), 15920-15926 (1998) 

CA 129:159801 

46 (bases 153410 to 154508) 

Dekkers, L.C. ; van der Bij,A.J.; Mulder s , I . H . ; 
Phoelich, C.C. ; Wentwoord, R . A. ; Glandorf , D . C. ; 
Wijf felman, CA. ; Lugtenberg, B . J . 

Role of the O-antigen of lipopolysaccharide, and 
possible roles of growth rate and of NADH: ubiquinone 
oxidoreductase (nuo) in competitive tomato root-tip 
colonization by Pseudomonas fluorescens WCS365 
Mol. Plant Microbe Interact., 11 (8), 763-771 (1998) 
CA 129:200426 

47 (bases 146433 to 148685) 

Saito,K.; Yamazaki, H. ; Ohnishi,Y.; Fujimoto,S.; 
Takahashi, E . ; Horinouchi, S . 

Production of trehalose synthase from a basidiomycete, 
Grifola frondosa, in Escherichia coli 
Appl. Microbiol. Biotechnol., 50 (2), 193-198 (1998) 
CA 130:956 

48 (bases 122486 to 123895) 

Kimura,M.; Matsumoto, G . ; Shingu,Y.; Yoneyama,K.; 
Yamaguchi, I . 

The mystery of the trichothecene 3-O-acetyltransf erase 
gene. Analysis of the region around TrilOl and 
characterization of its homologue from Fusarium 
sporotrichioides 

FEES Lett., 435 (2-3), 163-168 (1998) 
CA 130:11163 

49 (bases 146433 to 148685) 

Saito,K.; Kase,T.; Takahashi , E . ; Horinouchi , S . 
Purification and characterization of a trehalose 
synthase from the basidiomycete grifola frondosa 
Appl. Environ. Microbiol., 64 (11), 4340-4345 (1998) 
CA 130:34870 

50 (bases 14029 to 15172) 
Feng,Y.; Khoo,H.E.; Poh,C.L. 

Purification and characterization of gentisate 

1 , 2-dioxygenases from Pseudomonas alcaligenes NCIB 9867 

and Pseudomonas putida NCIB 9869 

Appl. Environ. Microbiol., 65 (3), 946-950 (1999) 

CA 130:322223 

51 (bases 59070 to 60560) 

Bosch, R.; Moore, E.R.; Garcia-Valdes, E . ; Pieper,D.H. 

NahW, a novel, inducible salicylate hydroxylase 

involved in mineralization of naphthalene by 

Pseudomonas stutzeri ANIO 

J. Bacterid., 181 (8), 2315-2322 (1999) 

CA 131:70144 

52 

DeZwaan, T .M. ; Carroll, A.M. ; Valent,B.; Sweigard, J. A. 
Magnaporthe grisea pthllp is a novel plasma membrane 
protein that mediates appressorium differentiation in 
response to inductive substrate cues 
Plant Cell, 11 (10), 2013-2030 (1999) 
CA 132:47358 

53 (bases 122486 to 123895) 



AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI): 



JOURNAL (SO): 
OTHER SOURCE (OS) : 

REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 
OTHER SOURCE (OS) : 

REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 

OTHER SOURCE (OS) : 

REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 
JOURNAL (SO) : 

REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 
OTHER SOURCE (OS) : 

REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
REFERENCE : 



McCormick, S .P. ; Alexander , N . J . ; Trapp,S.E.; Hohn,T.M. 
Disruption of TRIlOl, the gene encoding trichothecene 
3-0-acetyltransf erase, from Fusarium sporotrichioides 
Appl. Environ. Microbiol., 65 (12), 5252-5256 (1999) 
CA 132:118174 

54 (bases 111907 to 113460) 
Muraguchi , H . ; Kamada,T. 

A mutation in the eln2 gene encoding a cytochrome P450 
of Coprinus cinereus affects mushroom morphogenesis 
Fungal Genet. Biol., 29 (1), 49-59 (2000) 
CA 134:96073 

55 (bases 108826 to 111209) 

Yu,J.; Chang, P. K.; Bhatnagar , D . ; Cleveland, T . E . 
Genes encoding cytochrome P450 and monooxygenase 
enzymes define one end of the aflatoxin pathway gene 
cluster in Aspergillus parasiticus 

Appl. Microbiol. Biotechnol., 53 (5), 583-590 (2000) 

CA 133:291807 

56 (bases 122486 to 123895) 

Muhitch,M. J. ; McCormick, S . P . ; Alexander, N . J . ; Hohn,T.M. 

Transgenic expression of the TRIlOl or PDR5 gene 
increases resistance of tobacco to the phytotoxic 
effects of the trichothecene 4, 15-diacetoxyscirpenol 
Plant Sci., 157 (2), 201-207 (2000) 
CA 133:306551 

57 (bases 107279 to 108340) 

Cheng, Q.; Thomas, S .M. ; Kostichka, K . ; Valentine, J . R. ; 
Nagarajan, V. 

Genetic analysis of a gene cluster for cyclohexanol 
oxidation in Acinetobacter sp. Strain SE19 by in vitro 
transposition 

J. Bacteriol., 182 (17), 4744-4751 (2000) 
CA 134:37832 

58 (bases 100363 to 101496) 

Jornvall,H.; Hoog,J.O.; Persson,B.; Pares, X. 
Pharmacogenetics of the alcohol dehydrogenase system 
Pharmacology, 61 (3), 184-191 (2000) 
CA 134:38655 
59 

Calabrese, D. ; Bille,J.; Sanglard,D. 

A novel multidrug efflux transporter gene of the major 
facilitator superfamily from Candida albicans (FLUl) 
conferring resistance to fluconazole 

Microbiology (Reading, Engl.), 146 (PT 11), 2743-2754 

(2000) 

CA 135:1055 
60 

Deising,H.B. ; Werner, S.; Wernitz,M. 

The role of fungal appressoria in plant infection 

Microbes Infect., 2 (13), 1631-1641 (2000) 

61 (bases 43783 to 45994) 
Smith, S. 

The world according to PARP 

Trends Biochem. Sci., 26 (3), 174-179 (2001) 

CA 135:15779 

62 (bases 131646 to 132375) 

Cronk,J.D.; Endrizzi, J. A. ; Cronk,M.R.; 0 ' neill, J . W . ; 
Zhang, K.Y. 

Crystal structure of E. coli beta-carbonic anhydrase, 
an enzyme with an unusual pH-dependent activity 
Protein Sci., 10 (5), 911-922 (2001) 

63 (bases 43783 to 45994) 



AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



Ziegler,M.; Oei,S.L. 

A cellular survival switch: poly (ADP-ribosyl ) ation 
stimulates DNA repair and silences transcription 
Bioessays, 23 (6), 543-548 (2001) 

64 (bases 43783 to 45994) 
Herceg,Z.; Wang,Z.Q. 

Functions of poly ( ADP-r ibose ) polymerase (PARP) in DNA 
repair, genomic integrity and cell death 
Mutat. Res., 477 (1-2), 97-110 (2001) 
OA 136:145570 

65 (bases 15614 to 17392) 

Mundodi, S .R. ; Watson, B.S.; Lopez-Meyer , M. ; Paiva,N.L. 
Functional expression and subcellular localization of 
the Nectria haematococca Makl phytoalexin 
detoxification enzyme in transgenic tobacco 
Plant Mol. Biol., 46 (4), 421-432 (2001) 
CA 135:285870 

66 (bases 1 to 163680) 

Pel,H.J.; de Winde,J.H.; Archer , D . B . ; Dyer,P.S.; 
Hofmann,G.; Schaap, P . J . ; Turner, G.; de Vries , R . P . ; 
Albang,R.; Albermann, K . ; Andersen, M. R. ; Bendtsen, J . D . ; 
Benen,J.A.; van den Berg,M.; Breestraat, S . ; 
Caddick,M.X. ; Contreras, R. ; Cornell, M.; Coutinho, P . M. ; 
Danchin, E . G . ; Debets, A. J. ; Dekker,P.; van Dijck,P.W.; 
van Dijk,A.; Di jkhuizen, L . ; Driessen, A. J. ; d'Enfert,C.; 
Geysens,S.; Goosen,C.; Groot,G.S.; de Groot,P.W.; 
Guillemette, T . ; Henrissat, B . ; Herwei jer, M. ; van den 
Hombergh, J.P. ; van den Hondel, C . A. ; van der 
Heijden,R.T. ; van der Kaaij,R.M.; Klis,F.M.; 
Kools,H.J.; Kubicek,C.P. ; van Kuyk,P.A.; Lauber,J.; 
Lu,X.; van der Maarel,M.J.; Meulenberg, R . ; Menke,H.; 
Mortimer , M. A. ; Nielsen, J.; Oliver, S.G.; Olsthoorn, M. ; 
Pal,K.; van Peij,N.N.; Ram,A.F.; Rinas,U.; Roubos , J . A. ,• 
Sagt,C.M.; Schmoll,M.; Sun, J.; Ussery,D.; Varga,J.; 
Vervecken, W. ; van de Vondervoort , P . J . ; Wedler,H.; 
Wosten, H . A. ; Zeng,A.P.; van Ooyen,A.J.; Visser,J.; 
Stam, H. 

Genome sequencing and analysis of the versatile cell 
factory Aspergillus niger CBS 513.88 
Nat. Biotechnol., 25 (2), 221-231 (2007) 
CA 146:310276 

67 (bases 1 to 163680) 
Pel, H.J. 

Direct Submission 

Submitted ( Ol-MAY-2006 ) Pel H.J., DSM, 624-0295, P.O. 
Box 1, 2600 MA Delft, THE NETHERLANDS 



FEATURES (FEAT) : 
Feature Key 



<41 . .>545 
join(<41. .325, 
417. .>545) 
join(41. .325, 417. . 
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/organism="Aspergillus niger" 
/mol-type="genomic DNA" 
/db-xref="taxon:5061" 
/clone="Anl8c0 0 40" 
/locus-tag="Anl8g0 092 0" 
/locus-tag="Anl8g0 092 0" 

/locus-tag="Anl8g0092 0" 
/codon-start=l 

/product=" hypothetical protein" 
/protein-id="CAK47170 .1" 
/db-xref="GI : 134084137" 



/translation="MRGTRLSLRTPYTAKSHYTT 

TGAQKPLHGSLCEATSFVGCSSVW 

LRGISCSTQYKGKLNIHLNLTSGGRWTPIPAGF 

EHGQGRDLTDELANETLLAFSASP 

AASGCRILFFELDLSVLGSSTSVANAAYLQTRCL 

V" 

exon 41.. 325 /locus-tag="Anl8g00920" 

/number=l 

intron 326.. 416 /locus-tag="Anl8g00920" 

/number=l 

exon 417.. 545 /locus-tag="Anl8g00920 " 

/number=2 

gene 874.. 954 /gene="tRNA-Arg (CCG) " 

/locus-tag="Anl8e00930" 

tRNA 874.. 954 /gene="tRNA-Arg (CCG)" 

/locus-tag="Anl8e0 093 0" 
/product="tRNA-Arg" 
/inference="profile:tRNAscan: 1.4" 
/note="codon recognized: CGG" 

gene complement (<1241 . .>3043 /locus-tag="Anl8g00940" 

) 

mRNA complement ( join (<1241. . /locus-tag="Anl8g00940" 

1273,1325. .1814, 
1867. .2175, 2229. .2599, 
2960. .>3043) ) 

CDS complement (join (1241. .1 /locus-tag="Anl8g00940" 

273, 1325. .1814, 
1867. .2175, 2229. .2599, 
2960. .3043) ) 

/EC-number= "1.1.99.12" 

/inference="prof ile:COGS:COG2303" 

/inference="prof ile:PFAM:PF05199" 

/note="unnamed protein product; 

Catalytic activity: L-sorbose + 

acceptor <=> 5-dehydro-D-f ructose 

+ reduced acceptor. Cof actor: FAD. 

Title: strong similarity to FAD 

dependent L-sorbose dehydrogenase 

SDH - Gluconobacter oxydans" 

/citation=[34] 

/ codon-start=l 

/protein-id="CAK47171 .1" 

/db-xref="GI : 134084138" 

/translation="MGFVAGFDSDLEWNLRSVPQ 

ERAFGGDTVSHATNSGKPRNYPLR 

EKSRRFHEFLGRKQADDMNSGQQLGYSEAFSSSY 

AGLRSFASSYPLGSNVTLWTKATV 

ERLIMAGRTAEGVAVARAVGDQIERVQVRARKEV 

ILSAGAYGSPKILLLSGIGPAAEL 

KRHN I TPVADLPVGKNYADHPH I FT YWT I DECNA 

TLGDGEMETDQVHWLAGLPYDWMS 

FAPADTKTQEIAKKILSPTERSRYLADGKMQLEN 

FIVYTAVDTSARKKPLEGWPGKRV 

ISLMHVLVDSVSRGTLTLRSSDPADAPVLDPKLL 

ASPVDRSALYENVRETARAIQAVE 

GLNAVEFGVDDSLRDNWSNEAIEVRANRSGGSTY 

HYSGTCAMGEVVDTECRVYQAERL 

RWDSGIIPLPLAAHYQAPTYGVAEQAADMILSS 

VA" 

exon complement (1241. .1273) /locus-tag="Anl8g00940" 

/number=l 

intron complement ( 1274 .. 1324) /locus-tag="Anl8g00940" 
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intron 


complement ( 1815 . 


.1866) 


/locus-tag= 


"Anl8c 


J00940' 








/number=2 






exon 


complement ( 1867 . 


.2175) 


/locus-tag= 


"Anl8g00940 








/number=3 






intron 


complement ( 2176 . 


.2228) 


/locus-tag= 


"Anl8c 


J00940' 








/number=3 






exon 


complement (2229 . 


.2599) 


/locus-tag= 


"Anl8c 


J00940' 








/ number=4 






intron 


complement ( 2600 . 


.2959) 


/locus-tag= 


"AnlBc 


J00940' 








/ number=4 






exon 


complement ( 2960 . 


.3043) 


/locus-tag= 


"Anl8c 


J00940' 








/number=5 






gene 


complement ( <3 78 8 


. .>4775 


/ locus-tag= 


"AnlBc 


J00950 


mRNA 


) 

complement ( join ( 


<3788 . . 


/ locus-tag= 


"AnlBc 


J00950' 



exon 

intron 

intron 
exon 
gene 
mRNA 



4122, 4201 . . 4523, 
4678. .>4775) ) 
complement ( join (3788 . . 4 
122, 4201. .4523, 
4678. .4775) ) 



complement (3788 . . 4122) 

complement ( 4123 . .4200) 

complement (4201. .4523) 

complement (4524. .4677) 

complement (4678. .4775) 

complement (<5212 . .>7379 
) 

complement ( join (<5212 . . 
5456, 5493 . . 5664, 
5746. .6075, 6140. .6159, 
6225. .6602, 6674. .6746, 
6800. .6899, 
7249. .>7379) ) 
complement (join (5212 . .5 



/locus-tag="Anl8g00950" 



/inference="profile: COGS: COGO 454" 

/inf erence="prof ile : COGS : COGO 456 " 

/inf erence="similar to AA 

sequence:UniProtKB:AE010349 .4" 

/note="unnamed protein product; 

Title: weak similarity to 

hypothetical acetyltransf erase 

MK0549 - Methanopyrus kandle" 

/codon-start=l 

/protein-id="CAK47172 .1" 

/db-xref="GI: 13 40 8 4139" 

/trans lation="MTKSQFEVREIHTKEEYARL 

VDVLWSANFNPYNPVFTAVHPVSG 

HTAEDRVKDKALDTEIRWAAHEKNPASHLIYVID 

TETGRVAGGCEWLIFHKNPFPNGP 

QPLSCTWYPEGSERAEYASRMLSQAFFPRMCWLQ 

RPHAGVNAMGVHP D YRRRGVGRL L 

MQWGHERIDPLGYESWIEGSPIGRWLYEESGYKR 

VI S LN I DFAKKNP S DEWNRLVHEC 

QPPAILLLWRPPRGEWNDKVPPGPWAVTEDTWK" 

/locus-tag="Anl8g00950" 

/ number=l 

/locus-tag="Anl8g0 0950" 
/ number=l 

/locus-tag="Anl8g00950" 
/number=2 

/locus-tag="Anl8g00950" 
/number=2 

/locus-tag="Anl8g00950" 
/number=3 

/locus-tag="Anl8g00960" 
/locus-tag="Anl8g00960" 



/locus-tag="Anl8g00960" 



456, 5493. .5664, 
5746. .6075, 6140. 
6225. .6602,6674. 
6800. .6899, 7249. 



6159, 
6746, 
7379) ) 



exon 
intron 

intron 

exon 

intron 



complement ( 5212 . 
complement ( 5457 . 

complement (5493 . 
complement ( 5665 . 
complement (5746 . 
complement ( 6076 . 



5456) 
5492) 

5664) 
5745) 
6075) 
6139) 



/inference="prof lie: COGS :COG0 833" 

/inf erence="similar to AA 

sequence :PIR: SI 11 75" 

/note="unnamed protein product; 

Function: Hnml of S. cerevisiae is 

the unique coline permease in 

yeast (Km of 0 . 5 micromolar) . 

Regulation: in S. cerevisiae, 

expression of HNMl gene is 

regulated by the phospholipid 

precursors inositol and choline 

and this regulation involves the 

trans-acting factors Ino2p, Ino4p 

and Opilp. Remark: alternate name 

for S. cerevisiae Hnml is YGL077c. 

Similarity: Hnml of S. cerevisiae 

belongs to the APC family, which 

encompasses amino acid permeases 

more related to the mammalian 

cationic amino-acid transporter 

family (Tea/ecoR) . Title: strong 

similarity to choline permease 

Hnml -Saccharomyces cerevisiae 

plasma membrane" 

/citation=[12] 

/citation=[19] 

/citation=[24] 

/codon-start=l 

/protein-id="CAK47173 .1" 

/db-xref="GI: 134084140" 

/translation="MARNSDVEHSIELRAPTNRD 

SGTP S KATF S D DAYE L ARVGKKE V 

GGPAGLVYGFLGVWAGIISVFISMGELASMMPSA 

GGQYHWVSILAPRSARKFLSHVTG 

SVCIIAWTAAPTAAIYLAASVLQSTIAMNIPSYD 

PKGWHITLIMWAILLVCTVLNTWL 

GMILPVIEVLILLVHVLGFFAVLVPLVYLGPKAD 

PRSIFTVSFDYGGWGDLTLATFIG 

LKGTVAAFVGTDGAVHMAEEVANSSRWPRSMLL 

ALMINGATGFAILIAFLFTAGDLL 

KIVESSASYPFMYMLASSTGSKGAAWLSSMMAI 

LQACAGLAGISSGSRMLWSFSREQ 

AIPGWRWVRQVNQRTLVPFHSTLWWAAGLLSL 

INIGSAWLNI ILSLVLEAFFASY 

MISLTLLLYRGGVLNWGPFRVKGWLGTANNIFAI 

AYSIIMMFFGCWPPENHPAPKNIN 

YS I VIFAGVTL I S I I YYVGWARKHYQGPLAEL " 

/locus-tag="Anl8g00960" 

/ number=l 

/locus-tag="Anl8g00960" 

/number=l 

/locus-tag="Anl8g0 096 0" 
/number=2 

/locus-tag="Anl8g00960" 
/ number=2 

/locus-tag="Anl8g00960" 
/number=3 

/locus-tag="Anl8g00960" 



exon 

intron 

intron 

exon 

intron 

exon 

intron 

exon 

gene 

mRNA 

CDS 



exon 

intron 

exon 



complement 

complement 

complement 

complement 

complement 

complement 

complement 

complement 

complement 

complement 
) 

complement 
8282, 8380. 
complement 
282, 8380. . 



(6140. .6159) 

(6160. .6224) 

(6225. .6602) 

(6603. .6673) 

(6674. .6746) 

(6747. .6799) 

(6800. .6899) 

(6900 . . 7248) 

(7249. .7379) 

(<7770. .>8415 

( join(<7770. . 
.>8415) ) 
( join(7770. .8 
8415) ) 



/number=3 

/locus-tag= 

/number=4 

/locus-tag= 

/number=4 

/locus-tag= 

/number=5 

/locus-tag= 

/number=5 

/locus-tag= 

/number=6 

/locus-tag= 

/number=6 

/locus-tag= 

/number=7 

/ locus-tag= 

/number=7 

/locus-tag= 

/number=8 

/locus-tag= 



"Anl8g00960" 
"Anl8g00960" 
"Anl8g00960" 
Anl8g00960" 
"Anl8g00960" 
"Anl8g00960" 
"Anl8g00960" 
"Anl8g00960" 
"Anl8g00960" 
"Anl8g00970" 



complement (7770. .8282) 

complement (8283. .8379) 

complement (8380. .8415) 

<8621. .>9700 
join (<8621 . .8764, 
8830. .9156, 9214. .9356, 
9406. .>9700) 
join(8621. .8764, 
8830. .9156,9214. .9356, 
9406. .9700) 



/locus-tag="Anl8g00970" 

/locus-tag="Anl8g00970" 

/note="unnamed protein product; 

Title: weak similarity to 

hypothetical protein BAA96207.1 - 

Oryza sativa" 

/ codon-start=l 

/protein-id="CAK47174.1" 

/db-xref="GI : 134084141" 

/translation="MLCTHAGGEGARSGRPDDRQ 

GHQPISVALWLCAPPPGFLSGRTS 

LASAGISNPDGATAVNTCETVGCLGLTACGQDCL 

SSRAALAELLLGLCCVGDGGGRAF 

PANPNKMDRYLLLYCRLGLTLDGPIWVWHTLS 

LSCSPPTQLHPPPLLASLHSFFPP 

ESLGRVPI S I YAHMRHAAAKIV" 

/locus-tag="Anl8g00970" 

/number=l 

/locus-tag="Anl8g00970" 

/ number=l 

/locus-tag="Anl8g00970" 
/number=2 

/locus-tag="Anl8g00980" 
/locus-tag="Anl8g00980" 



/locus-tag="Anl8g00980" 

/note="unnamed protein product; 
Function: M. grisea Pthll is a 
pathogenicity gene. Function: M. 
grisea Pthllp is likely to be 
involved in host surface 
recognition. Function: M. grisea 
pthll mutants of strain 4091-5-8 
are nonpathogenic due to a defect 
in appressorium differentiation. 
Localization: in M. grisea, a 



sig-peptide 
mat -peptide 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

gene 

mRNA 

CDS 



sig-peptide 



join(8699. .8764, 
8830. .9156, 9214. . 
9406. .9697) 

8621. .8764 

8765. .8829 

8830. .9156 

9157. .9213 

9214. .9356 

9357. .9405 

9406. .9700 

complement (<9787 . 
) 

complement (<9 78 7. 
) 

complement ( 9787 . . 



Pthll-green fluorescent protein 

fusion localised to the cell 

membrane and vacuoles. Similarity: 

similarity of the predicted A. 

niger protein and M. grisea Pthll 

is limited to the N-terminal half 

of the protein sequences. Title: 

weak similarity to integral 

membrane protein PTHll - 

Magnaporthe grisea plasma 

membrane" 

/citation=[52] 

/codon-start=l 

/protein-id="CAK47175.1" 

/db-xref="GI : 134084142" 

/translation="MTDHSAAVKWTCFLLIVSF 

lAWACLTTNWQVLRRKVSSVALL 

LSTLIASIASGAAVSVAATHGLGQASPLTDAQW 

VMQKALYSMEVLYVLTLGLGKLSV 

MVLFYSLLSSTGQSKSVLAATGLLLIWVWMVIV 

VCLQCHPPEVWNIVGGTCLDLSGI 

WIAFGVMNVLVEIMIIAVPSFIIFRLKLSLKRRL 

WISCFGIRILDIAGSIVQLCYVR 

NFKIHADSPMPTNVWQWAICSQVLQTVAILSACV 

PYLREFLESFPSGMFKPTELKHPT 

VQSAYNATKCSDSDIELMRPESTKDT" 

/locus-tag="Anl8g00980" 

/ inf erence="protein 

motif :SignalP: 2.0" 

/locus-tag="Anl8g00980" 



/product=" unnamed" 

/locus-tag="Anl8g00980 

/number=l 

/locus-tag="Anl8g0 09 8 0 
/number=l 

/locus-tag="Anl8g00980 
/number=2 

/locus-tag="Anl8g00980 
/number=2 

/locus-tag="Anl8g00980 

/ number=3 

/locus-tag="Anl8g00980 
/ number=3 

/locus-tag="Anl8g0 09 8 0 
/number=4 
>9927 /locus-tag="Anl8g00990 



complement (9877. .9927) 



.>9927 /locus-tag="Anl8g00990 



/locus-tag="Anl8g00990" 
/note="unnamed protein product; 
Title: strong similarity to EST 
an-1359 -Aspergillus niger" 
/codon-start=l 
/protein-id="CAK47176 .1" 
/db-xref="GI : 134084143" 
/translation="MRAATLLLVALSAVALALPT 
TNNVERAAADANVRQMDELTVAAI SK" 
/locus-tag="Anl8g00990" 
/ inf erence="protein 



mat -peptide 
exon 



exon 

intron 

exon 

intron 

exon 



complement (9790. .9876) 

complement (9787. .9927) 

<10500 . .>11479 
join(<10500. .10749, 
10812 . .11041, 
11105. .>11479) 
join (10500 . .10749, 
10812 . .11041, 
11105. .11479) 



10500. .10749 

10750. .10811 

10812. .11041 

11042. .11104 

11105. .11479 

<11980. .>13464 
join (<11980 . . 12137, 
12198. .>13464) 
join(11980. .12137, 
12198. .13464) 



motif :SignalP: 2.0" 

/locus-tag="Anl8g00990" 

/product="unnamed" 

/locus-tag="Anl8g00990" 

/number=l 

/locus-tag="Anl8g01000" 
/locus-tag="Anl8g01000" 



/locus-tag="Anl8g01000" 



/note="unnamed protein product; 

Similarity: the nucleotide 

sequence of the ORF shows strong 

similarity to the ESTs an-2192 and 

an-2204 of A. niger. Title: strong 

similarity to EST an-2192 

-Aspergillus niger" 

/ codon-start=l 

/protein-id="CAK47177.1" 

/db-xref="GI : 134084144" 

/translation="MYSYEKVNRDDDSDQGLLAE 

DSERVAPQPRVSRFPSWTNALLLL 

GLLFSLSLNVGWFVFQKGHNVPIQDLDPDAAMAK 

QRSPYTGLAWDTHKPYSHHSEYTS 

ENATHADEMWESLSMDPMVIAPTWEWAQSKGLSD 

SWAFPWDSNRRIYFIKVFHQLHCL 

KLMRHSYHELWSGQESSIPAPHIEHCLDSLRQDL 

MCKADDTPMPSLQLLNGGGEGQQM 

QCKDFDKLVAWSKAPERNACYKRLTDYKPIVHSI 

ERYAFCPEDSEHYPTMSKYFEEHG HYADPFSE" 

/locus-tag="Anl8g01000" 

/number=l 

/locus-tag="Anl8g01000" 



/ number= 
/locus-tag= 
/ number= 
/locus-tag= 
/number=2 
/locus-tag="Anl8g01000" 
/number=3 
/locus-tag= 
/locus-tag= 



:"Anl8g01000" 

"AnlBgOlOOO" 



'Anl8g01010" 
'Anl8g01010" 



/locus-tag="Anl8g01010" 

/inf erence="prof ile : COGS : COGO 4 7 7 " 
/note="unnamed protein product; 
Function: FLUl of C. albicans 
facilitates resistance to 
fluconazole and cycloheximide in 
the f luconazole-hypersensitive S. 
cerevisiae strain YKKB-13 lacking 
the ABC (ATP-binding cassette) 
transporter gene PDR5 . Function: 
FLUl of C. albicans facilitates 
resistance to mycophenolic acid in 
C. albicans. Function: FLUl of C. 
albicans is involved in the 
resistance to azol derivatives in 
C. albicans. Function: MDRl of C. 



exon 

intron 

exon 

gene 

mRNA 

CDS 



11980. .12137 
12138. .12197 

12198 . . 13464 



151 



complement ( <14029 
72) 

complement (join(<14029 
.14318, 14411. .14845, 
14917. .>15172) ) 
complement (join ( 14029 . 
14318,14411. .14845, 
14917. .15172) ) 



albicans facilitates resistance to 

the anti-mitotic drug benomyl and 

to the dihydrof olate reductase 

inhibitor methotrexate. 

Similarity: N-terminus of the A. 

niger protein is app . 120 aa 

shorter than in FLUl of C. 

albicans. Similarity: the 

predicted A. niger protein shows 

strong similarity to fluconazole 

resistance protein FLUlof C. 

albicans, which is a permease 

belonging to the major facilitator 

superfamily. Title: strong 

similarity to fluconazole 

resistance protein FLUl - Candida 

albicans" 

/citation=[59] 

/codon-start=l 

/protein-id="CAK47178.1" 

/db-xref="GI : 134084145" 

/translation="MDEEDGHCKETEVTVQSTSP 

VDSADYDPFDPQTRSTTYKWITW 

LVAGLSTMVQLSTIIAAPVSPSILAHFHSDNALY 

RTLIVSIWELGEIVAPLLWGPLSE 

L \ILFFVAFLAGTAASTSIQMLIAF 

RFL ..^.-^--T.-^-^^AIGPGIVSDLFPE 

ESRGRAMSIMSLTGALGPWGPIIGSYLGEKAGW 

RWAFWLPTIATGTLSLLILWYRE 

TYSVTLQKRKARQQNPESGSKSPEDTDKTASQVF 

FKAILRPLRLLIRSPMLILVTFYL 

SWYGYTYLVMTTIAPLFQDVYGFSEGASGLAFL 

GLCLGLILGAFLCSFLLDRYVRTA 

RARSGTSKPEQRLPPVLIACFVMSGGLFLFGWTA 

QYHVQWIAPIIGTGIIGFGLVSTT 

ITLQTYWDLFGIYAASATSAMLVPRNACAAFLP 

LAGPPLFDRLGYNWGGTLLALIVL 

VFSLMPLIFINYGERLRGKNLLDD" 

/locus-tag="Anl8g01010" 

/number =1 

/locus-tag="Anl8g01010" 
/ number=l 

/locus-tag="Anl8g01010" 
/number=2 

/locus-tag="Anl8g01020" 
/locus-tag="Anl8g01020" 



/locus-tag="Anl8g01020" 



/EC-number="l . 13 .11.4" 
/inference= "profile: COGS :C0G3 435" 
/inf erence="similar to AA 
sequence : UniProtKB :AF173167.3" 
/note="unnamed protein product; 
Complex: the estimated molecular 
mass of the purified gentisate 1, 
2-dioxygenase of P. alcaligenes 
was 154 kDa, with a subunit mass 
of 39 kDa. its structure is 



deduced to be a tetramer. Remark: 
gentisate 1, 2-dioxygenase of P. 
alcaligenes exhibits typical 
saturation kinetics and has an 
apparent Km of 92 microM for 
gentisate. this enzyme has broad 
substrate specificities towards 
alkyl and halogenated gentisate 
analogs. Similarity: the predicted 
A. niger protein shows strong 
similarity to gentisate 
1, 2-dioxygenase of P. alcaligenes 
and conserved hypothetical 
proteins from other procaryotic 
species. Title: strong similarity 
to gentisate 1 , 2-dioxygenases xlnE 
- Pseudomonas alcaligenes" 
/citation=[50] 
/codon-start=l 
/protein-id="CAK47179 .1" 
/db-xref="GI : 134084146" 
/translation="MSPSACNTGSNGADSAEQLL 
QDLKKSNTLPLWTQMTRLNPPEPN 
PTAVPFIWRYDSIRPNLLRAGELVTEKQAERRVL 
MLVNPARDAPYTTDTLYAGLQLVM 
PNETAPAHRHT AFAMRY 1 1 EGNGGF T AVHGKR I K 
MQKGDVILTPTWNYHDHGKDGTGP 
MIWLDGLDLPNFRHFPVHFVDHYDQPRYPAEDVD 
SATSPIVFPWDKMKAELDKAPGTW 
AVRRYLRADGSEGGSAERVDAGTSSLPRQETTSA 
VYHVIAGSGCSEIGDKTLVWETGD 
TFCVPSWYKYRHIAAEGETVYLYRFDDKPMISAL 
GFYRSIDTDLATLVSQ" 
complement (14029 . .14318 /locus-tag="Anl8g01020" 

/number=l 

complement (14319 . .14410 /locus-tag="Anl8g01020" 
/number=l 

complement (14411 . .14845 /locus-tag="Anl8g01020" 

/number=2 

complement (14846 . .14916 /locus-tag="Anl8g01020" 
/ number=2 

complement (14917. . 15172 /locus-tag="Anl8g01020" 



<15614. .>17392 
join(<15614. .15661, 
15723. .16281, 

16347. .16503, 
16584. .16739, 
16803. .16942, 
17034. .>17392) 
join(15614. .15661, 
15723 . . 16281, 
16347. .16503, 
16584. .16739, 
16803. .16942, 
17034. .17392) 



/number=3 

/locus-tag="Anl8g01030" 
/locus-tag="Anl8g01030" 



/locus-tag="Anl8g01030" 



/inference="prof ile:COGS:COG0654" 



exon 

intron 

exon 

intron 

exon 

intron 

exon 



15614. .15661 
15662. .15722 
15723 . . 16281 
16282. .16346 
16347. .16503 
16504. .16583 
16584. .16739 



/inference="profile:PFAM:PF0136 0" 
/note="unnamed protein product; 
Function: Makl from N. 
haematococca specifically 
hydroxylates the phytoalexins 
medicarpin and 

maackiain , converting them to less 

fungitoxic derivatives. 

Localization: heterologous 

expression of the Makl cDNA 

construct in plants indicated that 

Makl protein accumulates in the 

plant cytoplasm, associated with 

endoplasmic reticulum membranes. 

Similarity: the predicted A. niger 

protein shows strong similarity to 

Makl from the fungal pathogen N. 

haematococca. Makl belongs to the 

flavin-containing mono-oxygenases . 

best matches are with putative 

salicylate hydroxylases of several 

procaryotic species. Title: strong 

similarity to maackiain 

detoxification protein 1 MAKl - 

Nectria haematococca endoplasmatic 

reticulum" 

/citation=[31] 

/citation=[65] 

/codon-start=l 

/protein-id="CAK47180 . 1" 

/db-xref="GI : 134084147" 

/translation="MASTRMSAQSTIADLRVPDY 

PSNCLQFTQQLTASATHTLPQKDA 

SVRLNIILAGAGLGGLATAIALAQAGHAVKIYEQ 

TPVLGEVGAGIQIPSNSTRILFSL 

GLQSYLEPYVTAPESISFRRWQNGKVIGLTKLIP 

NFVNNFKAPYYVIHRADFHSALCQ 

KAL DVGVE I E L GAKWD YDP I VG S I T L ADG T KH S 

ADLIVAADGIKSVARNWLQGDEM 

RFQGPGFAAYRAWDVGKMRRDPDLSWILEKPAL 

NIWIGDSRHVMTYTIGAGKAFNMV 

LSHPEMTDPGTWKPETALEDMKAEFQGWDPILSK 

I IGMVEKTVKWPLLTGTLLQNWTV 

GKLVILGDAAHAMVPYMSQGAAMAVEDGIALSRS 

L SHMT SRDQLQKAL S I FQE VRKKR 

AGHMQE AS LLNGKLWHFPDGSL QQARDEAMAP E V 

QGIPFSHSPNQWSDPATQMWCYGY 

DAEEAI D I AWME S LEARVDCVH " 

/locus-tag="Anl8g01030" 

/number=l 

/locus-tag="Anl8g01030" 

/number=l 

/locus-tag="Anl8g0103 0" 
/number=2 

/locus-tag="Anl8g01030" 
/ number=2 

/locus-tag="Anl8g01030" 
/number=3 

/locus-tag="Anl8g01030" 
/number=3 

/locus-tag="Anl8g01030" 
/number=4 



intron 


16740. 


.16802 


/locus-tag= 


"Anl8g01030' 








/number=4 






exon 


16803. 


.16942 


/locus-tag= 


"Anl8c 


J01030' 








/number=5 






intron 


16943. 


.17033 


/locus-tag= 


"Anl8(: 


J01030' 








/ number=5 






exon 


17034. 


.17392 


/locus-tag= 


"Anl8c 


J01030' 








/number=6 






gene 


<17766 


. .>20491 


/locus-tag= 


"Anl8c 


J01040' 


mRNA 


join (< 


17766 . .17864, 


/locus-tag= 


"Anl8c 


J01040' 




17939. 


.18039, 










18108. 


.18343, 










18398. 


.19163, 










19373. 


.19935, 










20037. 


.20252, 










20325. 


.>20491) 








CDS 


join(17766. .17864, 


/locus-tag= 


"Anl8c 


J01040' 




17939. 


.18039, 










18108. 


.18343, 










18398. 


.19163, 










19373. 


.19935, 










20037. 


.20252, 










20325. 


.20491) 









/inference="prof ile:PFAM:PF00172" 
/inf erence="similar to AA 
sequence:UniProtKB:ENU56097 . 1" 
/note="unnamed protein product; 
Phenotype: facB recessive loss of 
function mutants in A. nidulans 
are deficient in acetate induction 
of acetyl-CoA synthase, isocitrate 
lyase, malate 
synthase, acetamidase, and 
NADP-isocitrate dehydrogenase. 
Similarity: the predicted A. niger 
protein shows similarity to 
acetate regulatory DNA binding 
protein FacB (facB) from A. 
nidulans, which belongs to the 
GAL4-type zinc cluster 
transcriptional activators. Title: 
similarity to acetate regulatory 
DNA binding protein facB - 
Aspergillus nidulans" 
/citation= [37] 
/codon-start=l 
/protein-id="CAK47181 . 1" 
/db-xref="GI : 134084148" 
/translation="MDFAPELPTPQGQSVKRRRV 
ALACDACRTRKSRCDGRRPKCGMC 
EDLGFECAYTPSTTTTNIIVQKDYLRDLEERVKS 
LETSMATVKSDLSGFASQINGSMP 
DAPRKENP LSNQSERFSDFAGTEDTI D AMG AVAF 
ADEEDCGFFGPSSNIAFLRHLSCA 
VAHSASAQKEITSPPLDRVAYDGGFVSATRPSSP 
SHDQRPESLEGEKFDKFALPPPEE 
SLALIHRYFADTGLLFPYIHPPTFFETYAELKDN 
SKRVRRTWLGLLNI ILAMAKLTAV 
SGTTSAETGISESAIYYHRALSLCKGEILRGTTL 
EVGRYS I YAPVRARVPVYLRATKI 
NTVQYLLVMGQYLQGTQKSVQAWTIHGLAVKAAL 
QLGLHSKGASRAFTPLEQEVRKRT 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



17766. .17864 
17865. .17938 
17939. .18039 
18040. .18107 
18108. .18343 
18344. .18397 
18398. .19163 
19164. .19372 
19373. .19935 
19936. .20036 
20037. .20252 
20253. .20324 
20325. .20491 



225 



complement (<20773 
55) 

complement ( join (<20773 
.21444, 21501. .21638, 

21710. .22119, 
22180. .22303, 
22370. .>22555) ) 
complement ( join (20773 . 
21444, 21501 . .21638, 
21710. .22119, 
22180. .22303, 
22370. .22555) ) 



WFGCWLDRVRANLFLLNRKT I QQAN I RVS T L YK 

QIANIIDRIYGQNLGCDSPLSIGE 

TVGRVLGIENQLFSWVLGLPESLRQVTVQSMREE 

lERSEVGDEGHHKLYPLKFRIILT 

LRYFHVQILLHRPILVKFLDATGPSGLEADEVKL 

LNDIGYSSMNKCVDSAMGIIDIIH 

ELVSTTGWQKDLLGAWWYSLYYTFNAALVIIGAM 

WVQRNRDTSEDPVDNGGNNNNMDI 

YLSRAVLTLQNLNVGNRMVDRCKYYLDQLITLLG 

LQADRIAETGI ISGMNNPVAGNST 

TGFDFPSLGIECGEFMIDDFFPGFARGSGFERW" 

/locus-tag="Anl8g01040" 

/number=l 

/locus-tag="Anl8g01040" 

/number=l 

/locus-tag="Anl8g010 40" 
/number=2 

/locus-tag="Anl8g01040" 
/number=2 

/locus-tag="Anl8g01040" 
/number=3 

/locus-tag="Anl8g01040" 
/number=3 

/locus-tag="Anl8g01040" 
/number=4 

/locus-tag="Anl8g010 40" 
/number=4 

/locus-tag="Anl8g01040" 
/number=5 

/locus-tag="Anl8g01040" 
/number=5 

/locus-tag="Anl8g01040" 
/number=6 

/locus-tag="Anl8g01040" 

/number=6 

/locus-tag="Anl8g010 40" 
/ number=7 

/locus-tag="Anl8g01050" 
/locus-tag="Anl8g01050" 



/locus-tag="Anl8g01050" 



/EC-number=" 1.13.11.43" 

/inference="profile:COGS:COG3670" 
/inference="prof ile:PFAM:PF03 055" 
/inf erence="similar to AA 
sequence :PIR:JC432 4" 
/note="unnamed protein product; 
Catalytic activity: 

lignostilbene-alpha, beta-dioxygena 
se isozymes (LSD-I, II, and III) 
of P. paucimobilis catalyse the 
reaction 

1 , 2-bis ( 4-hydroxy-3-methoxyphenyl ) 
ethylene + 0(2) = 2 vanillin 



Cof actor : 

lignostilbene-alpha, beta-dioxygena 
se isozymes (LSD-I, II, and III) 
of P. paucimobilis require iron as 
a cofactor. Complex: LSD-I, II, 
and III consist of alpha 
alpha, alpha beta, and beta beta 
subunits, respectively, they show 
different specificities for 
several substrates that are 
stilbene and styrene derivatives. 
Function: LSD of P. paucimobilis 
catalyses the oxidative cleavage 
of the interphenyl double bond in 
the synthetic substrate and 
lignin-derived stilbenes. it is 
responsible for the degradation of 
a diarylpropane-type structure in 
lignin. Similarity: the predicted 
A. niger protein shows strong 
similarity to subunit IsdB of 
lignostilbene-alpha, beta-dioxygena 
se isozyme LSD-III of P. 
paucimobilis and related 
dioxygenases from several plant 
species. Title: strong similarity 
to subunit IsdB of 

lignostilbene-alpha, beta-dioxygena 

se isozyme LSD-III -Pseudomonas 

paucimobilis" 

/citation=[16] 

/citation=[28] 

/codon-start=l 

/protein-id="CAK47182 .1" 

/db-xref="GI : 134084149" 

/translation="MTETRTTAFPARPQFSGFMK 

PCRVEGETLNLEVLGDLPTDIDGV 

FYRVMPDPQLPPFIENDPWFNGDGNITAFRFHDG 

RVSFQQRYVRTEKFVRERQAQRAL 

lAQLTLFLWSIEGKYRNKFTDAVEFKVRSTANTN 

IVYFNGQLLALKEDSPPYAMDPVT 

LETRGLYDFDGQLPSMTFTAHPKFDPETGEMICF 

GYEARGDGTPDVCYYTVGPDGKFT 

EWWLVAPWAMIHDFAVTDNWVIFPIIPQLCDI 

ERMKQGGEHWQWSPETPLYLGVIP 

RRGAKTADVKWFQYKNSFPGHTANAHEDSSGNLI 

IDLGLSEKNVFFWWPDAQGNAPEP 

SSIRSQLVRFTINPRAEDLNLTEPRILQADNSEF 

YRIDDRYATKPHRHVFFDMMDPSL 

GTDFARIAPVLGGGYPLYNSLARFDNVTGETEVY 

FPGKTHMVQEPVFIPRSELAAEGD 

GYLLALVNNYDSMSSELHLLDTADFTRARAKILL 

PVRLRPGLHGSWVDGSDVPLSAAR L" 



exon 



complement (20773 



21444 /locus-tag= 



="Anl8g01050" 



intron 



complement (21445 



/number=l 
21500 /locus-tag= 



="Anl8g01050" 



exon 



complement (21501 



/number=l 
21638 /locus-tag= 



="Anl8g01050" 



/number=2 



complement (21639. .21709 /locus-tag="Anl8g01050" 
/number=2 

complement (21710 . .22119 /locus-tag="Anl8g01050" 
/number=3 

complement (22120 . .22179 /locus-tag="Anl8g01050" 
/number=3 

complement (22180 . .22303 /locus-tag="Anl8g01050" 
/number=4 

complement (22304. .22369 /locus-tag="Anl8g01050" 
/number=4 

complement (22370 . .22555 /locus-tag="Anl8g01050" 



/number=5 

/locus-tag="Anl8g01060" 
/locus-tag="Anl8g01060" 



<23096. .>24052 
join(<23096. .23375, 
23443. .23571, 
23667. .>24052) 
join(23096. .23375, 
23443. .23571, 
23667. .24052) 



/locus-tag="Anl8 



/EC-number= "5.3.3.10" 
/inference="profile: COGS :COG0 179" 
/inference="prof ile:PFAM:PF01557" 
/note="unnamed protein product; 
Catalytic activity: 
5-carboxymethyl-2-hydroxymuconate 
delta-isomerase from E. coli 
catalyses the reaction 
5-carboxymethyl-2-hydroxymuconate 
= 5-carboxy-2-oxohept-3-enedioate . 
Pathway : 

5-carboxymethyl-2-hydroxymuconate 
delta-isomerase from E. coli is 
involved in tyrosine and 
phenylalanine metabolism. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
5-carboxymethyl-2-hydroxymuconate 
delta-isomerase from E. coli and 
many putative 

2-hydroxyhepta-2 , 4-diene-l, 
7-dioate isomerases from several 
procaryotic and eucaryotic 
organisms. Title: strong 
similarity to 

5-carboxymethyl-2-hydroxymuconate 

delta-isomerase -Escherichia coli 

cytoplasm" 

/citation=[18] 

/ codon-start=l 

/protein-id="CAK47183 .1" 

/db-xref="GI : 134084150" 

/translation="MSLPWKRLIRFQASDGRILR 

GEPIIPAAGETDLGLITEVDGLQA 

KLLSGSDIYDVTGKTQVTDEIVTVKRILSPLAQS 

DVP IMRCVGLN YAKH I REAGRKPP 

PFPFIFFKPNTCVHDHGEPWIPRIAQNDQADYE 



intron 
exon 
intron 
exon 



23096. .23375 
23376. .23442 
23443. .23571 
23572. .23666 
23667. .24052 



<2498 



joi 



25166 
25564 
25812 
26080 
26206 
26722 
27423 
join ( 
25166 
25564 
25812 
26080 
26206 
26722 
27423 



4. .>27426 
24984. .25058 
.25301, 
.25745, 
. .26003, 
. .26144, 
26646, 
27303, 
>27426) 
24984. .25058, 
. .25301, 
. .25745, 
26003, 
26144, 
. .26646, 
. .27303, 
. .27426) 



VIGKDAKDVPVESALDYIAAYTVG 

NDISSRKLQRDPAYAGWPQWGFSKGFDTFAPLG 

PVLVAASEIPDPSQLQLTTIVDGE 

VRQQESVGDLLFDCRYLISYLSQGTTLQKGSVIM 

TGTPGGKGKFYC" 

/locus-tag="Anl8g0106 0" 

/number=l 

/locus-tag="Anl8g01060" 
/number=l 

/locus-tag="Anl8g01060" 
/number=2 

/locus-tag="Anl8g01060" 
/number=2 

/locus-tag="Anl8g01060" 
/number=3 

/locus-tag="Anl8g01070" 
/locus-tag="Anl8g01070" 



/locus-tag="Anl8 



/inference="prof ile:COGS :COG0 4 77" 
/note="unnamed protein product; 
Function: FLUl of C. albicans 
facilitates resistance to 
fluconazole and cycloheximide in 
the f luconazole-hypersensitive S. 
cerevisiae strain YKKB-13 lacking 
the ABC (ATP-binding cassette) 
transporter gene PDR5 . Function: 
FLUl of C. albicans facilitates 
resistance to mycophenolic acid in 
C. albicans. Function: FLUl of C. 
albicans is involved in the 
resistance to azol derivatives in 
C. albicans. Function: MDRl of C. 
albicans facilitates resistance to 
the anti-mitotic drug benomyl and 
to the dihydrof olate reductase 
inhibitor methotrexate. 
Similarity: N-terminus of the A. 
niger protein is app . 120 aa 
shorter than in FLUl of C. 
albicans. Similarity: the 
predicted A. niger protein shows 
strong similarity to fluconazole 
resistance protein FLUlof C. 
albicans, which is a permease 
belonging to the major facilitator 
superfamily. Title: similarity to 
fluconazole resistance protein 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



24984. .25058 
25059. .25165 
25166. .25301 
25302. .25563 
25564. .25745 
25746. .25811 
25812. .26003 
26004. .26079 
26080. .26144 
26145. .26205 
26206. .26646 
26647. .26721 
26722. .27303 
27304. .27422 
27423. .27426 



complement (<27797. .>286 
67) 

complement ( join (<27797 . 
.28220,28259. .28342, 
28450. .28550, 
28602. .>28667) ) 
complement (join (27797 . . 



FLUl - Candida albicans" 

/citation=[59] 

/codon-start=l 

/protein-id="CAK47184.1" 

/db-xref="GI : 134084151" 

/translation="MRGVEELASENKDETRSGRR 

NLGGKQHFRGGGERSVDLALRPPR 

RGTPGSPQSQLAHGVSCSSWLVPRVRIPTDDPGD 

PLRWPSWLKLSVILSTSLVNFVSN 

MGGAGLSVAVPVLMQQLQRSQAEVTQLLTLNFLF 

LG I GN I FWVPL AVKFGKRAS 1 1 S S 

MLLQAGALVWCAVATGYDSLLGARCVLGFAAAAG 

E S I VPE I VAD I FFVHERATMMAI Y 

WLISGGSAVGPLIGGFMVEYASGTWRSYMWLCF 

ALASFNIVLLVFLFPESTFKRPEP 

SQAQELQLVAEHKETTSSFIENASAAAPEDSYTT 

NTPPFKGVLRPFHYNKDASFFKAV 

ISPLKLLVHPSVCWGIFTYGISLSPQIIMIFNFS 

ALLEASPYFFCSEYVGLMEIGAIV 

GFLLACYGGGYLSDIVNRMIAQRSSSSSIRPEQR 

LISLIPGMAIGPAGCILLAFACGH 

KLHWAAIAVGFGMVSFGTVYTPNIALTYIVHLHQ 

QDAANCLVS I S LVKNLVAFTFLYV 

AVDWVEREGYVQVFMIMFMLNVLALLCAVPLYFL 

GRRKDLNVYNGMEGRA" 

/locus-tag="Anl8g010 70" 

/number=l 

/locus-tag="Anl8g010 70" 
/number=l 

/locus-tag="Anl8g01070" 
/number=2 

/locus-tag="Anl8g01070" 
/number=2 

/locus-tag="Anl8g01070" 
/number=3 

/locus-tag="Anl8g010 70" 
/number=3 

/locus-tag="Anl8g010 70" 
/number=4 

/locus-tag="Anl8g01070" 
/number=4 

/locus-tag="Anl8g01070" 

/ number=5 

/locus-tag="Anl8g01070" 
/ number=5 

/locus-tag="Anl8g01070" 
/number=6 

/locus-tag="Anl8g01070" 
/number=6 

/locus-tag="Anl8g01070" 
/number=7 

/locus-tag="Anl8g01070" 

/ number=7 

/locus-tag="Anl8g01070" 
/number=8 

/locus-tag="Anl8g01080" 
/locus-tag="Anl8g01080" 



/locus-tag="Anl8g01080" 



28220, 28259 . .28342, 
28450. .28550, 
28602. .28667) ) 



complement ( join (28506 . . 
28550,28602. .28667) ) 



mat -peptide complement (join (27800 . . 

28220,28259. .28342, 
28450. .28505) ) 

exon complement (27797. .28220 

intron complement ( 28221 .. 28258 

exon complement (28259. .28342 

intron complement ( 28343 .. 28449 

exon complement(28450. .28550 

intron complement ( 28551 .. 28601 

exon complement(28602. .28667 

gene <28874 . . >30 837 

mRNA join(<28874. .29683, 

29736. .29875, 

29925. .>30837) 
CDS join(28874. .29683, 

29736. .29875, 

29925. .30837) 



/codon-start=l 

/product=" hypothetical protein" 
/protein-id="CAK47185 . 1" 
/db-xref="GI: 13 40 8 4152" 
/translation="MGPRKAFLNGYIWVRRIQP 
WRAL AAQAVP LIHTSNADSSPATG 
FYSLRFWSHPHPPTVPANHTVSGTNTSFDRHVI 
LVSNNSQWIINPWSIGRTLSWPGP 
VKSLEAGRYGHLRKHNPAAYEELNFASSNKHCCC 
VMLRAYYYPRVQEEYLLRLFLYRC 
LKDTASFATVRKKDQSLNVHESLQSTYPGGIFRV 
AAKADFPAEGSKSFRTQTMCWALL LKYGVF" 
/locus-tag="Anl8g01080" 

/inference= "protein 
motif : Signal? : 2.0" 
/locus-tag="Anl8g01080" 

/product=" hypothetical protein" 
/locus-tag="Anl8g01080" 

/number=l 

/locus-tag="Anl8g01080" 
/number=l 

/locus-tag="Anl8g01080" 

/number=2 

/locus-tag="Anl8g01080" 
/number=2 

/locus-tag="Anl8g01080" 
/number=3 

/locus-tag="Anl8g01080" 
/number=3 

/locus-tag="Anl8g01080" 
/number=4 

/locus-tag="Anl8g01090" 
/locus-tag="Anl8g01090" 



/locus-tag="Anl8 



/EC-number= "3.1.1.5" 
/inference="profile:PFAM:PF01735" 
/note="unnamed protein product; 
Catalytic activity: 
lysophospholipase catalyses the 
reaction 2 -lysophosphat idyl choline 
+ H(2)0 = glycerophosphocholine + 
a fatty acid anion. Pathway: 
lysophospholipase is involved in 
phospholipid degradation. 
Similarity: the predicted A. niger 



28874. .28921 



join(28922. .29683, 
29736. .29875, 
29925. .30834) 

28874. .29683 

29684. .29735 

29736. .29875 

29876. .29924 

29925. .30837 



protein shows strong similarity to 

the protein sequences 

lysophospholipase (sequence 5 and 

sequence 7) of patents 

WO0127251-A/5 and WO0127251-A/7 

from A. oryzae (AC# AX112082 and 

AC# AX112 084) and to other fungal 

lysophospholipases . Title: strong 

similarity to lysophospholipase 

from patent WO0127251-A - 

Aspergillus oryzae 

extracellular/ secretion proteins" 

/citation=[14] 

/ codon-start=l 

/protein-id="CAK47186 .1" 

/db-xref="GI : 134084153" 

/translation="MLSLLISAAAATLASALELP 

QGYSPDPVSCPTNLSWIRPAVGLS 

RDEAQWVEGRKNVILGSLDAYLKRLNLDDFDTDE 

YISRLNNTSQTPIMGMAISGGGFG 

SAYTGTGLIRALDDRLPAANEQRTGGLLQSMTYL 

SGLSGGSWPAVSFPSYNFPTADEI 

VDYWKPE I DRFFTVTNT S AEAATGKAI FEQ I ATK 

YLAGFEVALSDYLGRGFAYEFIPG 

QSGGLNTTFSGIRNLSNFINHQMPMPIIHLASVE 

PEDAEYYDLLVPSSNGTIFDLTPF 

EFGAWDGDVHAFTPTEWLGNQLSNGIPVNQSKCW 

KGFDRS S LVI GT S ADAFNFWYLE S 

VSNGTLGQFAKRSTTHESSLTKRLSQPANLNALV 

DAFQETFDLNLTQISYSKFPNPFT 

NLSLSTGNTHKSSTLNLVDGSETGQTIPLWGQIQ 

PARNVDFIIAWDDSQDADPYSWNN 

GTNLYNTYLAANATGLPFPIIPPSRTMMNLNYTL 

HPQFFGCDANLTTTGDDRAPIVLY 

MANAPYSAYTNFSFWQTETSRQQMGEIFVNSFDI 

VTQANGSWDGEWAECMGCAAVERS 

LARVGMERTRQCQRCFERYCWDGTLDERDPGVLD 

PTLVLDPGVKFGLWNATNPY " 

/locus-tag="Anl8g01090" 

/inference= "protein 

motif :SignalP: 2.0" 

/locus-tag="Anl8g01090" 



/product=" unnamed" 
/locus-tag="Anl8g0109 0" 
/number=l 

/locus-tag="Anl8g01090" 
/number=l 

/locus-tag="Anl8g01090" 
/number=2 

/locus-tag="Anl8g01090" 

/ number=2 

/locus-tag="Anl8g01090" 
/number=3 
>322 /locus-tag="Anl8g01100" 



complement (<30960 . 
18) 

complement (join (<3 0960. /locus-tag="Anl8g01100 " 
.31045,31081. .31220, 
31273. .31630, 
31702. .>32218) ) 

complement (join (30960 . . /locus-tag="Anl8g01100" 



31045, 31081 . .31220, 
31273. .31630, 
31702. .32218) ) 

/EC-number="l . 1.1.-" 
/inf erence="prof lie : COGS : COG06 0 4 " 
/inf erence="prof lie : COGS : COGl 06 4 " 
/inference="prof ile:PFAM:PF00107" 
/note="unnamed protein product; 
Remark: the patent does not 
provide further information about 
the function of the protein. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
the protein sequence alcohol 
dehydrogenase (sequence 19) of 
patent EP0845532-A/19 from an 
unclassified organism (AC# A92108) 
and to other zinc-containing 
dehydrogenases. Title: strong 
similarity to alcohol 
dehydrogenase from patent 
EP0845532-A - Unclassified 
organism cytoplasm" 
/codon-start=l 
/protein-id="CAK47187 . 1" 
/db-xref="GI: 13 40 8 4154" 
/translation="MSHAQITHRRAVRRTDDYTP 
HTPKVNIVTEPLPLPLSPTAVLIR 
VHAVALNYRDANIANGGNPWPVLPHGILGNDAAG 
EVI AVGEKVKT LAVGDRAAP I TDT 
EQITERATKRSWLAADEDGVMADYIVYDESVLCK 
LPAYLDWVQASMIPCAGVTAWSAL 
KGMRIGQTVLVQGTGGVSMFALKLALAAGLRVVL 
T S S S DAKLQAI KEKYAGS AI LTVN 
YKNTPEWDEEVLKLTGGVGVDLWENGGTGSLVK 
SMRCTRRGGWSQVGYLGKQDPSE 
LSELVPTLIDRRINLRGINAGSKLDMEDFCEALS 
ASQTLLDDLIDKVFPFEQAEEAVE 
YVWQEKDIGSERKIQNSFYHLGSYVGTWSSAS " 
complement (30960. .31045 /locus-tag="Anl8g01100" 



complement (31046 . 
complement (31081. 
complement (31221 . 

complement (31273 . 
;omplement (31631 . 
complement (31702 . 



<32976. .>33236 
<32976. .>33236 



/number=l 
.31080 /locus-tag="Anl8g01100" 

/ number=l 
.31220 /locus-tag="Anl8g01100" 

/number=2 
.31272 /locus-tag="Anl8g01100" 

/number=2 
.31630 /locus-tag="Anl8g01100" 

/ number=3 
.31701 /locus-tag="Anl8g01100" 

/number=3 
.32218 /locus-tag="Anl8g01100" 

/number=4 

/locus-tag="Anl8g01110" 
/locus-tag="Anl8g01110" 



exon 
gene 
mRNA 



32976. .33236 /locus-tag="Anl8g01110 " 

/codon-start=l 

/product="hypothetical protein" 

/protein-id="CAK47188 . 1" 

/db-xref="GI:13408 4155" 

/translation="MADQPPMGRDFIDAKNVDPV 

DHNDQTDPEDSASQPRFPGPGSST 

TAPSTSKPHSNKLLNKLDPRYDSDVLESEKQENE 

KAQRRGWD" 

32976. .33236 /locus-tag="Anl8g01110" 
/number=l 
>354 /locus-tag="Anl8g01120" 



complement (<3341 
77) 

complement (join (<33419 . 
.33465,33522. .34023, 
34080. .34956, 
35015. .35044, 
35095. .>35477) ) 
complement ( join (33419 . . 
33465,33522. .34023, 
34080. .34956, 
35015. .35044, 
35095. .35477) ) 



/locus-tag="Anl8g01120" 



/locus-tag="Anl8g01120" 



/EC-number="l 

/inf erence="prof ile : COGS : COG2303 " 
/inference="prof ile:PFAM:PF0 0 732" 
/inference="prof ile:PFAM:PF05199" 
/note="unnamed protein product; 
Catalytic activity: versicolorin B 
synthase from A. parasiticus 
catalyses the side chain 
cyclization of racemic versiconal 
hemiacetal to the bisfuran ring 
system of (-) -versicolorin B. 
Pathway: versicolorin B synthase 
from A. parasiticus is involved in 
the aflatoxin biosynthetic 
pathway. Remark: a splice site was 
detected upstream of the START 
codon. Remark: aflatoxins comprise 
a group of polyketide-derived 
carcinogenic mycotoxins. Remark: 
the genes encoding the aflatoxin 
biosynthetic enzymes in A. 
parasiticus are clustered. Remark: 
versicolorin B synthase from A. 
parasiticus possesses an 
amino-terminal sequence homologous 
to the ADP-binding region of other 
f lavoenzymes, but does not require 
flavin or nicotinamide cofactors 
for its cyclase activity. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
versicolorin B synthase from A. 
parasiticus and other 
oxidoreductases . Title: strong 
similarity to versicolorin B 
synthase vbs - Aspergillus 
parasiticus" 
/citation=[30] 
/ codon-start=l 



/protein-id="CAK47189 .1" 

/db-xref="GI : 134084156" 

/translation="MIRHVLLSLALQCMVTGTIA 

NPDTLSHNYDERGLFGDSEYDYW 

VGGGTAGNTIATRLAQKNYKVAVIEAGGSYELES 

VAEVPAADVLPVGSDPETAAAHDW 

GFVAENVPGANGRSIHYARGKCWGGSSAMNFMIY 

QRPTIESMEQWATAVNDSSYTFDQ 

TLPFYKNSVKFTPPNTQIRAKNATAGYDPSAYES 

TGGPLKVSYANYAMPFSTWMDLGM 

KAIGINETQDFNLGSLMGGQYCASTIDPNGEVRS 

SSEESFLANKPSTLTTYANTLAKK 

I I FNNQKQATGVQVKGS AGN I YT I KANREVI VS A 

GAFQSPQLLMVSGVGPQDQLEEHG 

IQWANRPGVGQNMWDHPFFAPSYRVNVQTFTAI 

ANDFLGIVGQF INMVGFGNGPLTN 

PIS DYLAWEKI PAALRS AF S S QT TKQLATFP S DW 

PEAEYISGAGYMGNVSNLLINQPE 

DGYQYASMLAVLITPTSRGNITLRSADTDDLPVI 

NPNWLATQSDQEVAIAMFKRVRAA 

FQSKAMAPVIIGNEYNPGLEVQSDEQILQWIKDN 

VMT L WHAAC TC KMGT S DDEMAWD 

SQARVYGVQGVRWDASAFPFLPPGHPQSTVYML 

AEKIANEIINGA" 

sig-peptide complement (35418 . .35477 /locus-tag="Anl8g01120" 

) 

/inference= "protein 
motif :SignalP: 2.0" 
mat -peptide complement ( join (33422 . . /locus-tag="Anl8g01120" 

33465,33522. .34023, 
34080. .34956, 
35015. .35044, 
3b09b. .35417) ) 

/product=" unnamed" 
complement (33 419 . .33 465 /locus-tag="Anl8g0112 0" 

/ number=l 

complement (33466 . .33521 /locus-tag="Anl8g01120" 
/number =1 

complement (33522 . .34023 /locus-tag="Anl8g01120" 
/number=2 

complement (34024. .34079 /locus-tag="Anl8g01120" 
/number=2 

complement (34080 . .34956 /locus-tag="Anl8g01120" 
/number=3 

complement (34957. .35014 /locus-tag="Anl8g01120" 
/number=3 

complement (35015. .35044 /locus-tag="Anl8g01120" 
/number=4 

complement (35045. .35094 /locus-tag="Anl8g01120" 
/number=4 

complement (35095. .35477 /locus-tag="Anl8g01120" 



intron 



exon 



intron 



exon 



intron 



exon 



intron 



gene 



<37406. .>38455 



/number=5 

/locus-tag="Anl8g01130" 



mRNA 
CDS 



<37405. .>38455 
37406. .38455 



/locus-tag="Anl8g01130" 
/locus-tag="Anl8g01130" 



/inference="profile:COGS:COG3000" 
/note="unnamed protein product; 
Remark: the patent does not 
provide further information about 
the function of the protein. 
Similarity: the predicted A. niger 
protein shows similarity to the 
protein fragment (SEQ ID NO: 
51484) of patent EP1033405-A2 from 
A. thaliana (AC# AAG41385) and to 
putative sterol desaturase family- 
proteins. Title: similarity to 
protein fragment SEQ ID NO: 51484 
from patent EP1033405-A2 - 
Arabidopsis thaliana" 
/codon-start=l 
/protein-id="CAK47190 . 1" 
/db-xref="GI : 134084157" 
/translation="MSGTHPNPKDSMKSTWRRLD 
RAEWT I YHWF YE I LGVHPEHLDKE 
VP VHQKTEK I P YMRT WS QHVW I L YHAF I P L AVHH 
VYVS YTGQNFTPVGAFFFYS I AFK 
LIAIHQLQVMRRMGHVLGFLDGDQHGRDGVPDVG 
VAKWRSLISTSTFRPIMTVFLSY 
RVSQAPAQMSWGWLPLEIGLYGIILDFWFYWYHR 
LMHDVGSLWKYHRTHHLTKHPNPL 
LTLYADTEQEFFDIAGIPLMTYFSMRLMGMPMGF 
YEWWICHQYWFTELAGHSGLRMH 
ASPPSTLDWLLRLIGAELVIEDHDLHHRKGWKKS 
HNYGKQTRLWDKIFGTCHERIESR 
EGNVDYDNTVRMPIF" 



/EC-number="l . 1.1.169" 
/inf erence="prof ile:COGS :COG0240" 
/inference="prof ile:PFAM:PF02558" 
/note="unnamed protein product; 
Title: similarity to ketopantoate 
reductase -Salmonella typhimurium 
cytoplasm" 
/citation= [ 23 ] 
/citation=[42] 
/codon-start=l 
/protein-id="CAK47191 . 1" 
/db-xref="GI : 134084158" 
/translation="MTSQRIHLIGLGSIGTLIAH 
SLRSLPAPPPITILLHRESAYQDF 
LSGGSKLRLQIGEDGPTQETTGYDAELLQGESPI 
QHLIVAVKASATIPALRSIQHRLG 
RHSVICLFQNGLGQIEELNERLFTDPLTRPTYMF 
GIMRHGVYIKAPFEAVLAGLTGSC 
ALGI VNNDQQAS S L S AS S TAS AQSRFL INRLVEA 
PIVRSSELPWMELHQAQLLKLATN 
CWNPLTALLDVRNGSLLANSELQEMRRQLLQEI 
SLVFRRLPEFQGLPDVEEQFSVAR 
LEEQLIGNIEKTASNSSSMREDVRAGRATEIAYI 
NGWIVRRGQQLGLDCPTNRCLTQL 



exon 



37406. .38455 



/locus-tag="Anl8g01130" 
/number=l 



gene 
mRNA 
CDS 



<38805. .>39836 
<38805. .>39836 
38805. .39836 



/locus-tag="Anl8g01140" 
/locus-tag="Anl8g01140" 
/locus-tag="Anl8g01140" 



ILAKSSQGR" 



sig-peptide 



38805. .38876 



/locus-tag="Anl8g01140" 
/inference= "protein 



motif :SignalP: 2.0" 



mat -peptide 



38877. .39833 



/locus-tag="Anl8g01140" 
/product=" unnamed" 



exon 



38805. .39836 



/locus-tag="Anl8g01140" 
/number=l 



gene 
mRNA 



<40728 . .>42671 
join(<40728. .41152, 



/locus-tag="Anl8g01150" 
/locus-tag="Anl8g01150" 



41226. .42344, 
42401. .>42671) 



CDS 



join(40728. .41152, 



/locus-tag="Anl8g01150" 



41226. .42344, 
42401. .42671) 



/inference= "profile: COGS :COG0 477" 
/inf erence="similar to AA 
sequence:UniProtKB:AF188621 . 1" 
/note="unnamed protein product; 
Function: FLUl of C. albicans 
facilitates resistance to 
fluconazole and cycloheximide in 
the f luconazole-hypersensitive S. 
cerevisiae strain YKKB-13 lacking 
the ABC (ATP-binding cassette) 
transporter gene PDR5 . Function: 
FLUl of C. albicans facilitates 
resistance to mycophenolic acid in 
C. albicans. Function: FLUl of C. 
albicans is involved in the 
resistance to azol derivatives in 
C. albicans. Function: MDRl of C. 
albicans facilitates resistance to 
the anti-mitotic drug benomyl and 
to the dihydrof olate reductase 
inhibitor methotrexate. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
fluconazole resistance protein 
FLUlof C. albicans, which is a 
permease belonging to the major 
facilitator superfamily. Title: 
strong similarity to fluconazole 
resistance protein FLUl - Candida 
albicans" 
/citation=[59] 
/ codon-start=l 
/protein-id="CAK47192 .1" 
/db-xref="GI : 134084159" 
/translation="MSPIDEQLEMAEVDARPHTN 
GASKEEGLERPSSSGTTGSTGYSS 
SLEYNPQLHTLHSRNTELDLERHRTATSHALSRI 
ETQRLQHQLTVGESVKSRASRTPL 
PRMGANKDYPPPLPDREDYWEFDGPEDPLYPQN 
WELGTKIYISAILAFTSICSTFDS 
AIFSSSTTNVARVFGVGIEVATLSSSLYICGYAS 
GP L VWAP L S E L KGRKP P I WAML G 
FGIFNTAVAVAKDLQTIMICRFFCGVFGSCPLAV 
VAAVF S D I YNNRTRGVAI AMF S S T 
VFLGPLLAPFIGGFINTSYLGWRWTAYIPAFMGY 
AAFVLNMFFLKESYPPIILVSKAS 
ELRRRTKNWGIHAKQEEIEVDLRELLVNNFSRPL 



exon 


40728. 


.41152 


intron 


41153. 


.41225 


exon 


41226. 


.42344 


intron 


42345. 


. 42400 


exon 


42401. 


.42671 


gene 


42950. 


.43033 


tRNA 


42950. 


.43033 


gene 
mRNA 

CDS 


<43783. .>45994 
join(<43783. .43852, 
43912. .44025, 
44075. .44344, 
44395. .45240, 
45294. .45376, 
45425. .>45994) 
join(43783. .43852, 
43912 . . 44025, 
44075. .44344, 
44395. .45240, 
45294. .45376, 
45425. .45994) 



RLLFGEPLILAVTIYLSFIYGLLY 

CFLTAYTLVFQGVYGMSAGVGGLTLFGMWGLFI 

AASYI IFASRGYNKKLEANGGIPV 

PEWRLPPVIIGGALFAGGLFWFGWTGFTKSIHWI 

VPTLSGLFTGLGLLIIFIQLFNYL 

IDTYLMFAASAIAANTFCRSLVAASFPLFSRQMF 

NNMG I QWAAT LLGCVASILVPIPV 

AFYFYGKNLRKRSKFAPFYDKVAESHAEDPVENG 

VAEH" 

/locus-tag="Anl8g01150" 
/number=l 

/locus-tag="Anl8g01150" 
/number=l 

/locus-tag="Anl8g01150" 

/number=2 

/locus-tag="Anl8g01150" 
/number=2 

/locus-tag="Anl8g01150" 
/number=3 

/gene="tRNA-His (GTG) " 

/locus-tag="Anl8e01160" 

/gene="tRNA-His (GTG) " 

/locus-tag="Anl8e01160" 

/product="tRNA-His" 

/inf erence="prof ile : tRNAscan : 1.4" 

/note="codon recognized: CAC" 

/locus-tag="Anl8g01170" 

/locus-tag="Anl8g01170" 



/locus-tag="Anl8g01170" 



/EC-number= "2.4.2.30" 

/inf erence="prof ile:PFAM:PF00644" 

/inference="prof ile:PFAM:PF02 8 77" 
/note="unnamed protein product; 
Catalytic activity: the NAP 
protein of Z. mays catalyses the 
reaction NAD(+) + 
{ADP-D-ribosyl } (N) -acceptor <=> 
nicotinamide + 

{ADP-D-ribosyl} (N+1 ) -acceptor . 
Function: the NAP protein of Z. 
mays is involved in programmed 
cell death or apoptosis. 
Localization: the NAP protein of 
Z. mays is a nuclear enzyme. 
Remark: the ADP-D-ribosyl group of 
NAD(+) is transferred to an 
acceptor carboxyl group on a 
histone or the enzyme itself, and 
further ADP-ribosyl groups are 
transferred to the 2 '-position of 
the terminal adenosine moiety, 
building up a polymer with an 



average chain length of 20-30 
units. Remark: the NAP 
polynucleotide sequences can be 
used for modulation of programmed 
cell death in eukaryotic cells, 
the method is used, specifically 
in plants, to induce, or protect 
against, programmed cell 
death, depending on the extent to 
which PARP activity is reduced, 
reducing expression of endogenous 
NAP class PARP only is also used 
to modulate programmed cell death, 
to increase growth rate and to 
produce plant cells that are more 
tolerant of stress (cold, chemical 
treatments, pathogens , pests , 
drought, heat, etc. , or during 
transformation) . particular 
applications are generation of 
plants that are resistant to fungi 
or nematodes; are male or female 
sterile ; or have better 
seed-shatter properties, the 
methods are also used to improve 
growth of transformed plant cells 
(and derived calli or complete 
plants) . Similarity: the predicted 
A. niger protein shows st 
rong similarity to the protein 
sequence poly (ADP-ribose ) 
polymerase NAP protein of patent 
WO200004173-A1 from Z. mays (AC# 
AAY68834) and from many other 
eucaryotic organisms. Title: 
strong similarity to 
poly (ADP-ribose) polymerase NAP 
protein from patent WO200004173-A1 
- Zea mays nucleus" 
/citation=[61] 
/citation=[63] 
/citation=[64] 
/ codon-start=l 
/protein-id="CAK47193 .1" 
/db-xref="GI : 134084160" 
/translation="MTGQAFKGLVIAVAGSFSGG 
YKQANLKTIIQRHGGTFSSAVTED 
CTHLVTTQREVDNKSVKYTQARKVYTCNIVSLDW 
L VE S D S AGKKL DEKKF LMG S D I KK 
DDEPESPKKRTLEQALGINEDGTTKKLKDAQTVG 
TKQINVPVDDTCPLRLTFTVYIDP 
TGLIWDATLNQTSATNNNNKFYRIQLLHRNNEFR 
TWTHWGRVGEHGQHALLGGGGLDE 
AEYEFKKKFKDKSGLTWENRLDPPKKGKYTFIEK 
NYEEDTEDEDEDEDKWAKKPTKP 
KAEEVKCTLSAPVQDLVSFIFNKDFFQSTMASMS 
YDAQKLPLGKLSKRTLQNGFQALK 
DLSELIANPALASTKYDTSFTAAVEHLSNLYFTV 
IPHAFGRNRPPVLNNDNLLKREIE 
LLEALTDMEVANSIMKDARNTDTVHPLDRQFQGL 
NMQEMTPLEHTSTEFIELANYLNQ 
SRGHTHGVQYKVINIFRIERQGEKDRFQSSMYSN 



IQNSCRRLLWHGSRSTNFGGILSQ 
GLRIAPPEAPVSGYMFGKGVYFADMSTKSAGYCF 
SWGSGNRGLLLLCDVEVGNPMYER 
DTASFNAGQEAKAEAKIATLGRGRSIPGGWKDAG 
CVNEDLKGVLMPDVRMPTTNSNSR 
GLMYNEYIVYDVAQIQQKYLFHVDMR" 



exon 


43783. . 


,43852 


/locus-tag= 


"Anl8c 


J01170' 








/number=l 






intron 


43853. . 


43911 


/locus-tag= 


"Anl8g01170' 








/number=l 






exon 


43912. . 


44025 


/locus-tag= 


"AnlBc 


J01170' 








/number=2 






intron 


44026. . 


,44074 


/locus-tag= 


"Anl8c 


J01170' 








/number=2 






exon 


44075. . 


,44344 


/locus-tag= 


"Anl8c 


J01170' 








/number=3 






intron 


44345. . 


,44394 


/locus-tag= 


"Anl8c 


J01170' 








/number=3 






exon 


44395. . 


,45240 


/locus-tag= 


"AnlBc 


J01170' 








/ number=4 






intron 


45241. . 


,45293 


/locus-tag= 


"AnlBc 


J01170' 








/number=4 






exon 


45294. . 


,45376 


/locus-tag= 


"Anl8c 


J01170' 








/number=5 






intron 


45377. . 


,45424 


/locus-tag= 


"Anl8c 


J01170' 








/number=5 






exon 


45425. . 


,45994 


/locus-tag= 


"Anl8c 


J01170' 








/number=6 






gene 


<46879. 


, .>48537 


/locus-tag= 


"Anl8c 


J01180' 


mRNA 


join(<46879. .48454, 


/locus-tag= 


"Anl8c 


J01180' 




48518. . 


,>48537) 








CDS 


join(46879. .48454, 


/locus-tag= 


"AnlBc 


jOllBO 




48518. . 


,48537) 









/EC-number = " 3.5.1.11" 
/note="unnamed protein product; 
Function: penicillin V 
amidohydrolase (PVA) from F. 
oxysporum is used for the 
enzymatic hydrolysis of penicillin 
V (phenoxy-methylpenicillin) to 
6-aminopenicillanic acid (6-APA) . 
6-APA is the active beta-lactam 
nucleus used in the manufacture of 
semi-synthetic penicillins. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
the protein sequence penicillin V 
amidohydrolase (PVA) of patent 
US5516679-A from F. oxysporum (AC# 
AAW00291). Title: strong 
similarity to penicillin V 
amidohydrolase PVA from patent 
US5516679-A - Fusarium oxysporum 
extracellular/ secretion proteins" 
/ codon-start=l 
/protein-id="CAK47194 . 1" 
/db-xref="GI:1340 8 4161" 
/translation="MLVYLTNFFIAWLAYFWQPK 
KQPCGSLTLPHIPGVDIISMTRQE 
VHNYSVATSPPYNLADVHNLDFCNVSVTLTHPGE 
DDTVS VS VWLP LNNWNGRFQATGG 
GGLAAGIIGLAPAHPVSQGYATAATDGGLTLNGT 



exon 
intron 
exon 
gene 

tRNA 



46879. .48454 
48455. .48517 

48518 . . 48537 

complement (4888 
) 

complement (4888 



<49850 
join (<4 
49981. . 
50233. . 
50419. . 
50702. . 
50883. . 
51199. . 
5153: 
join (49 
49981 
50233. . 
50419. . 
50702. . 
50883. 
51199. 
51538. 



.>51919 
9850. .49 

50173, 
50362, 
50614, 
50828, 
51127, 
51470, 
>51919) 
850. .498 
50173, 
50362, 
50614, 
50828, 
51127, 
51470, 
51919) 



SNPQTGAWILRPDGSINTALLLNF 

AHRSIYDMTIIGKTLTERFYGSPPRYSYWSGCST 

GGRQGYFAAAKYPNLFDGVLAGAP 

ALNFPRLIGYMFWPPVHMFHSAAPPQCVFDTFWK 

AIIDECDPLDGATDGLISDYNPQS 

CPFKPETLVGHTVTCPEMGSDSPVTITAQHATLV 

KQILQGPDLQDHPDLWTGLPPGAS 

FRGTANTQVINGSIVRPVPFFPIIGWIKNFVYRD 

P D YNVF DMTF D DFNT AYRL T L DG Y 

NGI LGSDDLNL SEFRRAGGKLLTWHGLADEL I PA 

SWTNAFWEGIDKTDGADVDEFYRV 

FLAPGLGHCSAGHGPKPVDLLGALVRWVEEGIAP 

DRLSAAAVKADGKEVTRELCRYPA 

T L VYDGHGDVN DAG S F S L VF VGA " 

/locus-tag="Anl8g01180" 

/number=l 

/locus-tag="Anl8g01180" 
/number=l 

/locus-tag="Anl8g0118 0" 
/number=2 
8960 /gene="tRNA-Ala (AGO)" 

/locus-tag="Anl8e01190" 
8960 /gene="tRNA-Ala (AGO)" 

/locus-tag="Anl8e0119 0" 

/prociuct="tRNA-Ala" 

/ inf erence="prof ile : tRNAscan : 1 . 4 " 

/note="cocion recognized: GCU" 

/locus-tag="Anl8g01200" 

/locus-tag="Anl8g01200" 



/locus-tag="Anl8g01200" 



/EC-number = " 1.14.-.-" 
/inf erence="prof ile : COGS : C0G2 12 4" 
/inference="prof ile:PFAM:PF00067" 
/inf erence="similar to AA 
sequence:UniProtKB:AFU81806 .1" 
/note="unnamed protein product; 
Function: ordl of A. flavus 
converts 0-methylsterigmatocystin 
to aflatoxin Bl . Pathway: ordl of 
A. flavus catalyzes the last step 
of the aflatoxin biosynthetic 
pathway. Remark: aflatoxins 
comprise a group of 
polyketide-derived carcinogenic 
mycotoxins. Similarity: the 
predicted A. niger protein shows 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

intron 
exon 



49850. .49888 
49889. .49980 
49981. .50173 
50174. .50232 
50233. .50362 
50363. .50418 
50419. .50614 
50615. .50701 
50702. .50828 
50829. .50882 
50883. .51127 
51128. .51198 
51199. .51470 
51471. .51537 
51538. .51919 



strong similarity to 
0-methylsterigmatocystin 
(OMST) -oxidoreductase (ordl) from 
A. flavus, which belongs to the 
CYP64 family of cytochrome 
P450-type monooxygenases . Title: 
strong similarity to 
0-methylsterigmatocystin 
oxidoreductase ordl - Aspergillus 
flavus" 

/citation= [36] 

/codon-start=l 

/protein-id="CAK47195.1" 

/db-xref="GI : 134084162" 

/translation="MQHRHNPSSSCPTSQCGRGA 

FIAFTSVYIFFAVLTRKSKRPPLP 

PGPRRKP I VGNLWDLPDP SQQDWQHWLKHKDRYG 

PISSLSIMGQTIIVLNDARLAVEL 

LESRSSIHSSRPQQHFAEMAGWNNVLGAVKQSQR 

FRATRKNLHREIGSNVSVARFNEI 

QT AE VGRF L LRVL DAP DKLMKH I RKWVG YG YT I E 

PHDQDPLVDLADKAMMDFSMAMLP 

ATWAVDFFLPLKYLPSWFPGTEFMKIAQRYRKNV 

TAF S D I P YAFVKEQMRTGRFVP SF 

LSNLLESSDLEPGSEEENTVKWSAGSLYAGGADT 

TVS S I ASFFLAMALFPEVQRKAQQ 

ELDTVIGTDRLPQYADREQLPYINALVKETFRWH 

PWPMSLTHTSTADDVCEGYFIPK 

GSSVLANIWAFTHDPAAYHDPMTFKPERFLSPKP 

ERDPHFLVFGFGRRVCPGRTLADV 

NVYLTVAQALAVFE I SKPVKNGKVKDVQPEFLPG 

VISHPAPFDVSIRPRSAKHLELLR 

SLEQKYPWEKSNAEDLKNI " 

/locus-tag="Anl8g01200" 

/number=l 

/locus-tag="Anl8g01200" 
/number=l 

/locus-tag="Anl8g01200" 
/number=2 

/locus-tag="Anl8g01200" 
/number=2 

/locus-tag="Anl8g01200" 

/ number=3 

/locus-tag="Anl8g01200" 
/ number=3 

/locus-tag="Anl8g01200" 
/number=4 

/locus-tag="Anl8g01200" 
/number=4 

/locus-tag="Anl8g01200" 
/number=5 

/locus-tag="Anl8g01200" 

/ number=5 

/locus-tag="Anl8g01200" 
/number=6 

/locus-tag="Anl8g01200" 
/number=6 
/locus-tag= 
/number=7 
/locus-tag= 
/number=7 
/locus-tag= 



'Anl8g01200" 
'Anl8g01200" 
•Anl8g01200" 



/number=8 

gene complement (<52148 . .>537 /locus-tag="Anl8g01210" 

20) 

mRNA complement ( join (<52148. /locus-tag="Anl8g01210" 

.52359,52411. .52573, 
52635. .52939, 
52984. .53309, 
53368. .>53720) ) 

CDS complement ( join (52148 . . /locus-tag="Anl8g01210" 

52359, 52411 . .52573, 
52635. .52939, 
52984. .53309, 
53368. .53720) ) 

/inference="prof ile:COGS:COG0515" 
/inference="prof lie: PFAMrPFO 0 069" 
/note="unnamed protein product; 
Function: SRPK2 from M. musculus 
phosphorylates SF2/ASF, a member 
of SR splicing factors. Remark: 
overexpression of murine SRPK2 
causes disassembly of 
cotransfected SF2/ASF and 
endogenous SC35. SRPK family 
members may regulate the 
disassembly of the SR proteins in 
a tissue-specific manner. 
Similarity: the predicted A. niger 
protein shows similarity to SRPK2 
from M. musculus and strong 
similarity to putative 
serine/threonine protein kinases 
from several eucaryotic organisms. 
Title: similarity to 
SR-protein-specif ic kinase SRPK2 - 
Mus musculus nucleus" 
/citation= [39] 
/codon-start=l 
/protein-id="CAK47196 .1" 
/db-xref="GI : 134084163" 
/translation="MAGRITQCARRLGLLHAKLH 
LSCRHSMPTPTPHLRNITRASYQT 
IRYSSQWKSSLTVFPTTDALIDPSIEIEEETLPT 
YHPEKYYPVQQGEVLDNRYQVLAK 
LGYGVT S TVWLGRDLRD S KYWLKI YVTGQEKNH 
ELEIYNRMNAVEVEHPGRDLVRRL 
FDHFTVTGPHGPHVCLVHEPMGMSADTLLQKYIP 
GNTMTLDEMKTCIRQLLIALDFLH 
SAARIVHTGKDLQLKNLLLPVPNTKTLETLEERE 
VNDPSPRKILKDRTIYLSTVYNPG 
GSGLPLISDFGEARFGDVEKRDDIMPNMYRAPEV 
VLKENWNYKVDIWNVAMVAWDIVI 
PRHMFDGRNADGIFDDRVHIAEMIALMGPPPASF 
RERCRLAYVFWDEQGNWKDLAPIP 
DISLESLGADIPGENREGFFRWLRKALQWNAEDR 
PTATEFLFDEWLMEGLRRPNNEDA KT" 
exon complement(52148. .52359 /locus-tag="Anl8g01210" 

) 

/ number=l 

intron complement ( 52360 .. 52410 /locus-tag="Anl8g01210" 

) 

/number=l 

exon complement(52411. .52573 /locus-tag="Anl8g01210" 



gene 
mRNA 



/number=2 

complement (52574. .52634 /locus-tag="Anl8g01210" 

/number=2 

complement (52635. .52939 /locus-tag="Anl8g01210" 
/number=3 

complement (52940 . .52983 /locus-tag="Anl8g01210" 
/number=3 

complement (52984. .53309 /locus-tag="Anl8g01210" 
/number=4 

complement (53310. .53367 /locus-tag="Anl8g01210" 

/ number=4 

complement (53368. .53720 /locus-tag="Anl8g01210" 
/number=5 

complement (<54105. .>557 /locus-tag="Anl8g01220" 
62) 

complement (join (<54105. /locus-tag="Anl8g01220" 
.54731,54787. .55334, 
55392. .55500, 
55559. .>55762) ) 

complement (join (54105. . /locus-tag="Anl8g01220" 
54731, 54787. .55334, 
55392. .55500, 
55559. .55762) ) 



/inference="profile: COGS: COGO 477" 
/inf erence="similar to AA 
sequence :PIR:S6 4826" 
/note="unnamed protein product; 
Function: in S. cerevisiae, 
mutants (dal5) that lack 
allantoate transport have been 
isolated, these strains also 
exhibit a 60% loss of allantoin 
transport capability. Regulation: 
in S. cerevisiae Dal5 appears to 
be sensitive to nitrogen 
catabolite repression, feedback 
inhibition, and trans-inhibition. 
Regulation: in S. cerevisiae 
allantoate uptake is constitutive. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
allantoate permease gene (DAL5) 
from S. cerevisiae, which belongs 
to the major facilitator 
superfamily. Title: strong 
similarity to allantoate permease 
Dal5 - Saccharomyces cerevisiae" 
/ citation= [7] 
/citation=[9] 
/ codon-start=l 
/protein-id="CAK4719 7. 1" 
/db-xref="GI : 134084164" 
/translation="MEEKLPGAFSQALPVDRTDD 
AVEYLQGHAAADSPQIDLRALRRK 
IDWHLIPFMFSCYVLQFLDKVMLNYAAVMGMKTE 



LHLAGNEYTNTATWFFLAYLIAEA 
PN I YCLQKVPAAKWLGVNVALWGVAAAAS AGAKN 
YATLLTARVFLGIFEATIGPSLML 
ISSQYYTKSEQAPRFTLWYNGLGVAQIIGGLVSF 
GFQHVHHGATLAGWRIMFLVIGLV 
TVLIGVLTLLFIPDTPMKAKWLSEEEKVALLQHV 
SVNQTGVWSSAINLKQIWEGVLDI 
QLWLLVLITILISVSSGWTTYSATLIAGFGYSG 
PISALLNMPSGIVSIFFTLLVGFG 
IRRTSHRWAWNAFCTIPGIIGGGLLSFLPKSNKA 
GVL I G I YLVNAI VATLP I L YQWTM 
ANCAGHTKRAFSSALIAGSFSVGNIIGPQTFQAR 
DAPEYRPAKIAVLATQAAAAVMSV 
VLF AY YVWENRRRDRRYGKVE D SMVDE AKWAG L T 
DKENTRFRYVY" 
complement (54105. .54731 /locus-tag="Anl8g01220" 



complement ( 54732 . 



/number=l 
.54786 /locus-tag="Anl8 



/number=l 

complement (54787. .55334 /locus-tag="Anl8g01220" 



/number=2 

complement (55335. .55391 /locus-tag="Anl8g01220" 



/number=2 

complement (55392 . .55500 /locus-tag="Anl8g01220" 



/number=3 

complement (55501 . .55558 /locus-tag="Anl8g01220" 



/number=3 

complement (55559. .55762 /locus-tag="Anl8g01220" 



<56384 


.>58444 


join(<55384. .56645 


56694. 


56761, 


56818. 


57009, 


57069. 


57368, 


57430. 


57711, 


57897. 


58158, 


58326. 


>58444) 


join(56384. .56645, 


56694. 


56761, 


56818 . 


57009, 


57069. 


57368, 


57430. 


57711, 


57897. 


58158, 


58326. 


58444) 



/number=4 

/locus-tag="Anl8g0123 0" 
/locus-tag="Anl8g01230" 



/locus-tag="Anl8g01230" 



/note="unnamed protein product; 
Function: M. grisea Pthll is a 
pathogenicity gene. Function: M. 
grisea Pthllp likely is involved 
in host surface recognition. 
Function: M. grisea pthll mutants 
of strain 4091-5-8 are 
nonpathogenic due to a defect in 
appressorium differentiation. 
Localization: in M. grisea, a 
Pthll-green fluorescent protein 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

intron 

exon 

intron 



56384. .56645 

56646. .56693 

56694. .56761 

56762. .56817 

56818. .57009 

57010. .57068 

57069. .57368 

57369. .57429 

57430. .57711 

57712. .57896 

57897. .58158 

58159. .58325 

58326. .58444 

<59070. .>60560 
join(<59070. .59441, 
59521. .59806, 
59879. .60178, 
60237. .60312, 



fusion localised to the cell 

membrane and vacuoles. Similarity: 

similarity of the predicted A. 

niger protein and M. grisea Pthll 

is limited to the N-terminal half 

of the protein sequences. Title: 

similarity to integral membrane 

protein PTHll - Magnaporthe grisea 

plasma membrane" 

/citation= [52] 

/codon-start=l 

/protein-id="CAK47198 . 1" 

/db-xref="GI: 134084165" 

/translation=" MGWVHHAS PE VE AE S Q YRL I 

LGVCLGLTLLMIITVCLRLAIRFN 

ARRLEASDFVMLVTMVGYLRWIDDPTANRCQIFS 

IIYSALCVAQSRYGLGLPLSLRPK 

ADLPDYTKLNYAGRPFYQLGIAGFKASLCLNYLR 

LISGTSKNFYRILIWAVIAVSTLG 

HLAGTLVLIFDCQPVERAWNPNISGSCLPAGPTF 

YGLAIFTIICDITIIILPIPLLLQ 

LNIKTAQKAGWCLFLLGLFTTICSILRLTQIHR 

VAYGDGNS TML VLWGT I EFNVGN I 

VTCVPFLAPLLKGAVRDFRSYSGRKGYDSRSYAL 

QTWSKDPRSQLRSTTSAAPQPKRT 

PSEELILESGGVDEGGIHMTVELRVSLEKRPTTE 

HDG I N S AYAS QDQIRPGSYWVDVT 

DTSQWSRAEQARPTVPDGIFPAIKPSSIQEESCA 

SRETLMWHPDGQPRAATELCPAH 

ASWPVGLETGTKASSTTPSPTRFQPNILSAVKAL 

LGPALFHLTH" 

/locus-tag="Anl8g01230" 
/number=l 

/locus-tag="Anl8g01230" 
/number=l 

/locus-tag="Anl8g0123 0" 
/number=2 

/locus-tag="Anl8g01230" 
/number=2 

/locus-tag="Anl8g01230" 
/number=3 

/locus-tag="Anl8g01230" 

/ number=3 

/locus-tag="Anl8g01230" 
/ number=4 

/locus-tag="Anl8g0123 0" 
/number=4 

/locus-tag="Anl8g01230" 
/number=5 

/locus-tag="Anl8g01230" 
/number=5 

/locus-tag="Anl8g01230" 

/ number=6 

/locus-tag="Anl8g01230" 
/number=6 

/locus-tag="Anl8g0123 0" 
/number =7 

/locus-tag="Anl8g012 40" 
/locus-tag="Anl8g012 40" 



sig-peptide 
mat-peptide 



exon 

intron 

intron 
exon 



60374. .>60560) 
join(59070. .59441, 
59521. .59806, 
59879. .60178, 
60237. .60312, 
60374. .60560) 



59070. .59126 



/locus-tag="Anl8 



/inference= "profile: COGS :COG0654" 
/ inference= "profile :PFAM:PF0 136 0" 
/note="unnamed protein product; 
Catalytic activity: NahW of P. 
stutzeri catalyzes the conversion 
salicylate + NADH +0(2) <=> 
catechol + NAD(+) + H(2)0 + C0(2). 
Pathway: NahW of P. stutzeri is 
involved in the metabolization of 
naphtalene and salicylates (lower 
naphtalene degradation pathway) . 
Similarity: the predicted A. niger 
protein shows similarity to 
salicylate hydroxylase (nahW) of 
P. stutzeri, which belongs to the 
NADH-dependent monooxygenase 
superfamily. Title: similarity to 
salicylate hydroxylase nahW 
-Pseudomonas stutzeri" 
/citation= [51] 
/codon-start=l 
/protein-id="CAK47199 .1" 
/db-xref="GI : 134084166" 
/translation="MDDLPVLIVGAGISGLLLAQ 
HLQKLGVPYKIFERDAAIDARSGG 
WGLTLHWALPALRELLPDHLVQRLPEAYVNKAAA 
ARGDTGRFSFFDLKTGSALYSVPA 
AERIRVSRVRLRQLVATGLDVQWNKTLQNIESTA 
DTVTAHFADGTSYTGCLLIGCDGS 
RSPTREILYPDSHEMNPLPVQILGAATLYTAEEM 
AGAAEIDPFIFQGSHPESNVFLFF 
SILDTPNNFVESSKDKYECQIILSWADSKDIAVP 
SDNGERIALMKSLASDWAEPFRTL 
VHRLSEDTEARSIRIADWMFRPLQNRSHPRWLM 
GDSAHTMTMYRGEGANNAIVDVLD 
LTQRVDMRSLGTMSTQALRDALDAYENDVFRRAE 
PSVLNSRQACVDAHDFTRILDESP 
LVSARVLKEDTTEQ" 
/locus-tag="Anl8g012 40" 
/inference=" protein 
motif :SignalP: 2.0" 



join(59127. .59441, 


/locus-tag= 


"Anl8g( 


31240" 


59521. 


.59806, 








59879. 


.60178, 








60237. 


.60312, 








60374. 


.60557) 


/product="u 


nnamed' 




59070. 


.59441 


/ locus-tag= 
/ number =1 


"Anl8g( 


31240" 


59442 . 


.59520 


/locus-tag= 


"Anl8g01240" 






/number=l 






59521. 


.59806 


/locus-tag= 
/number=2 


"Anl8g( 


31240" 


59807. 


.59878 


/locus-tag= 
/number=2 


"Anl8g( 


31240" 


59879. 


.60178 


/locus-tag= 


"Anl8g( 


31240" 



intron 

intron 

exon 

gene 

mRNA 

CDS 



exon 
intron 



60179. .60236 
60237. .60312 

60313. .60373 

60374. .60560 

complement (<6 0793 . .>611 
69) 

complement (join (<60793 . 
.61039,61120. .>61169) ) 
complement (join (60793 . . 
61039,61120. .61169) ) 



complement (60793. .61039 
complement (61040 . .61119 
complement (61120 . .61169 
complement (61713. .61825 
complement (61713. .61825 



/number=3 

/locus-tag="Anl8g01240" 
/number=3 

/locus-tag="Anl8g012 40" 
/number=4 

/locus-tag="Anl8g012 40" 
/number=4 

/locus-tag="Anl8g012 40" 
/number=5 

/locus-tag="Anl8g01250" 

/locus-tag="Anl8g01250" 

/locus-tag="Anl8g01250" 

/note="unnamed protein product; 
Title: similarity to hypothetical 
protein encoded by Anl6g00840 - 
Aspergillus niger" 
/ codon-start=l 
/protein-id="CAK47200 . 1" 
/db-xref="GI : 134084167" 
/translation="MPNPSLSYGNGMSSGPKSRN 
VAFLTMFAVISGALLTFRMQSPSK 
EKRFSPEGDQQAVDGENKLGRSMFVSDGDREAVK 
GGKPGEPKVGRAPHEGLGGR" 
/locus-tag="Anl8g01250" 

/number=l 

/locus-tag="Anl8g01250" 
/number=l 

/locus-tag="Anl8g01250" 



/number=2 

/gene="tRNA-Leu (CAG) " 



/locus-tag="Anl8e01260" 
/gene="tRNA-Leu (CAG) " 



<61921 . .>62425 
join(<61921. .61955, 
62098. .>62425) 
join(61921. .61955, 
62098. .62425) 



61921. .61955 
61956. .62097 



/locus-tag="Anl8e01260" 
/product="tRNA-Leu" 
/inference="prof ile:tRNAscan: 1 .4" 
/note="codon recognized: CUG" 
/locus-tag="Anl8g012 70" 
/locus-tag="Anl8g012 70" 

/locus-tag="Anl8g012 70" 

/codon-start=l 

/product="hypothetical protein" 
/protein-id="CAK47201 . 1" 
/db-xref="GI : 134084168" 
/translation="MSDATRCIFITSAEWGSLRC 
SQQNPQFNQVDALVERPCNSTVSV 
TCSTAALTVDMHESRRSFSRKDQIATLRETIEYR 
VT VL AAC DRARF P TOP E I F S TNAV 
RDDLTRNLGPDSINTERR" 
/locus-tag="Anl8g012 70" 
/number=l 

/locus-tag="Anl8g01270" 
/number=l 



gene 
mRNA 



62098 . .62425 

complement (<6 
20) 

complement ( 
.64754, 6494 
65109. .6522 
65472 . .>655 
complement ( 
64754, 64947 
65109. .6522 
65472. .6552 



/locus-tag="Anl8g012 70" 
/number=2 
54435. .>655 /locus-tag="Anl8g01280" 

Din (<64435. /locus-tag="Anl8g01280" 
. .65072, 



Din (64435. . /locus-tag="Anl8g01280" 
.65072, 



complement (64435. .6475^ 



/codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK47202 . 1" 

/db-xref="GI : 134084169" 

/translation="MTTAAGWGPRLGRPDHLNLL 

PGPRGAANASRKPASESESGSGFH 

PRPWKQNARSRAGRVSPGLSLVPNFFFASCLYLV 

PGVDPRIWPRVRKNSIDNKLFAVL 

FFIATTIRCGVPLREKGMIRVITSYRATVTRRED 

KEASNLSKHGLLWGCLFQKEDQSK 

PVERGSPGGWMTESDTALDPWLRITRAGLHTEDF 

PYAVESWKAR" 

/locus-tag="Anl8g01280" 



/number=l 

complement (64755. .64946 /locus-tag="Anl8g01280" 



/number=l 
.65072 /locus-tag="Anl8g01280" 

/number=2 
.65108 /locus-tag="Anl8g01280" 

/number=2 
.65228 /locus-tag="Anl8g01280" 

/number=3 
.65471 /locus-tag="Anl8g01280" 

/number=3 
.65520 /locus-tag="Anl8g01280" 

/number=4 

/locus-tag="Anl8g01290" 
/locus-tag="Anl8g01290" 

/locus-tag="Anl8g01290" 



complement ( 64947 . 
complement (65073 . 
complement (65109 . 
complement (65229 . 
complement (65472 . 



<66055. .>67388 
join(<66055. .66654 
66723. .>67388) 
join(66055. .66654, 
66723. .67388) 



/note="unnamed protein product; 
Title: strong similarity to 
hypothetical protein encoded by 
Anl3g01340 - Aspergillus niger" 
/ codon-start=l 
/protein-id="CAK47203 .1" 
/db-xref="GI:13 40 8 4170" 
/translation="MLSLAIQSLLFEADAHAASP 
ALQRRS S S S AS TEGEAVMSPLP S L 
QTKGLIVISALATVSLICTLSLLTFFTYRFIFWK 
RHYKRY I GYNQ YWLMFNL I LADF 
IQSLGFIISLRWIQTNSVTASDPACFLQGIWLQI 



sig-peptide 

mat -peptide 

exon 

intron 

exon 

gene 
mRNA 
CDS 



66055. .66105 

join(66106. .66654, 
66723. .67385) 

66055. .66654 

66655. .66722 

66723. .67388 

<68231. .>69040 
<68231. .>69040 
68231. .69040 



gene 
tRNA 



68231. .69040 

69096. .69207 
69096 . .69207 



GDPMSGLFVLAIAAHTFMHVTLGY 

Q I SHRLFVS I I CGLWLFGVI TTVI P I AAHGRYVW 

YPAVAWCWMTPKYESMRLWTHYFW 

IFASEFFTWLYAIMFIQLRKKIAESAILGEHNS 

ESLTRLKRVIFHMALYPWYICLT 

LPLAAGRMASASGHSPSVLYFCFAGSFMTLCGFC 

DSLMYTLSRRSWLEPEARIHGSS 

NKYSSRPNKSSVAHHYGNSIDGKGPTNTTIARGR 

SDSTEEMIGKDGLELAPMGWLQH 

TTIEVTHEAAYDSASSNLSGQRERSSIYH" 

/locus-tag="Anl8g0129 0" 

/inference= "protein 

motif :SignalP: 2.0" 

/locus-tag="Anl8g01290" 

/product="unnamed" 

/locus-tag="Anl8g01290" 

/number=l 

/locus-tag="Anl8g01290" 
/number=l 

/locus-tag="Anl8g01290" 
/number=2 

/locus-tag="Anl8g01300" 
/locus-tag="Anl8g01300" 
/locus-tag="Anl8g013 00" 
/inference="prof ile:COGS:COG2808" 
/inference="prof lie: PFAM:PF0 4299" 
/note="unnamed protein product; 
Remark: paiB from B. subtilis 
seems to be essential. Similarity: 
the predicted A. niger protein 
shows weak similarity to 
transcription regulator 2 of the 
pai operon of B. subtilis and 
conserved hypothetical proteins 
from several procaryotic 
organisms. Title: similarity to 
transcription regulator paiB 
-Bacillus subtilis" 
/citation=[ll] 
/ codon-start=l 
/protein-id="CAK47204.1" 
/db-xref="GI :13 408 4171" 
/translation="MYMKEIYTDSTPSTLHTFIQ 
QNPLGVLTTAIPSSTHPLLQSTHI 
PWVLDIPPPSTADNSNTVKLRGHIARANPQCSAI 
LDSLTTQPESILPTEVLILFTSPY 
HSYITPHFYTTTKPLTGKVAPTWNYAAVQVYGRA 
RIYNPRSEGELGEQASMFLDTQLR 
DLTAHCEGEIMGYTTTGTSDTYTHNNPEEKSCPR 
AWT VDE AP DAY INI LKKN 1 1 G I E V 
TVTRMDGKFKMSQERGEGDREGVLKGLDGMGGDV 
ARGVAGMVRECCCDTRP " 
/locus-tag="Anl8g01300" 
/ number =1 

/gene="tRNA-Leu (CAG) " 
/locus-tag="Anl8e01310" 
/gene="tRNA-Leu (CAG) " 
/locus-tag="Anl8e01310" 
/product="tRNA-Leu" 
/inference="prof ile:tRNAscan: 1.4" 
/note="codon recognized: CUG" 



gene 
mRNA 



<71853. .>73356 
join(<71863. .72033, 



/locus-tag="Anl8g0132 0" 
/locus-tag="Anl8g01320" 



12091 . .72406, 
72467. .>73356) 



CDS 



join(71863. .72033, 



/locus-tag="Anl8g01320' 



72097. .72406, 
72467. .73356) 



/EC-number="3 .4.23.35" 
/inference="prof ile:PFAM:PF0 0 026" 
/note="unnamed protein product; 
Function: BARl of S. cerevisiae 
selectively cleaves the 
6-Leu- I -Lys-7 bond in the 
pheromone alpha-mating factor. 
BARl activity is abolished by in 
vitro mutation of an aspartic acid 
predicted to be in the active 
site. Induction: secretion of BARl 
of S. cerevisiae is stimulated to 
as much as five times the basal 
level by exposure of cells to 
alpha-factor. Localization: BARl 
of S. cerevisiae is secreted into 
the periplasmic space of MATa 
cells. Remark: BARl of S. 
cerevisiae is already active in 
early compartments of the 
secretory pathway. Barl protease 
tolerates large N-terminal 
extensions of its substrate and 
does not require Golgi-specif ic 
modifications such as outer-chain 
glycosylation for activity. 
Remark: the protein sequence of 
barrier protease BARl of S. 
cerevisiae is covered by patent 
W09118988-A (AC# AAR20109). the 
patented barrier protease is 
useful in industrial processes 
where leucine-lysine or 
leucine-arginine cleavage is 
required at low pH or high 
temperature, barrier protease may 
also be used in the production and 
isolation of proteins made by 
genetic engineering methods, e. g. 
to cleave fusion proteins at 
Leu-Arg bonds. Similarity: the 
predicted A. niger protein shows 
strong similarity to extracellular 
protease precursor BARl of S. 
cerevisiae which belongs to the 
subfamily Al of aspartic-type 
peptidases. Title: strong 
similarity to extracellular 
protease precursor Barl - 
Saccharomyces cerevisiae 
extracellular /secret ion proteins" 
/citation=[2] 
/citation= [3] 
/citation=[8] 
/citation= [38] 



/codon-start=l 
/protein-id="CAK47205.1" 
/db-xref="GI : 134084172" 
/translation="MKSTTLLSLAWAAQSAYSLS 
IHERDEPATLQFNFERRQIADRSR 
RKRSTASADLVNLATNLGYTMNLTLGTPGQEVSV 
TLDTGSSDLWVNGANSSVCPCTDY 
GSYNSSASSTYTFVNDEFYIQYVDGSEATGDYVN 
DTLKFSNVTLTNFQFAVAYDGDSE 
EGVLGIGYASNEASQATVGGGEYTNFPEALVDQG 
AINWPAYSLWLDDLDEGKGTILFG 
GVNTAKYYGSLQTLPIVSIEDMYVEFAVNLTAVH 
LEKNGNSVSVNNSATQFPIPAVLD 
SGTALTYIPTSAAASIYEAVGAQYLSEYGYGVIE 
CDVKDEDFTFLFDFGSFNMSVDI S 
EMILEASSDMTDMNVCTFGLAVIENEALLGDTFL 
RSAYWYDLGNNEISLAKANFNPG 
EDHVLE I GTGS DAVPKATGATATGAAAT S TAS S D 
KS DKE S S AT VP RS Q I VS L VAGVL V GVF L VL " 
sig-peptide 71863.. 71916 /locus-tag="Anl8g01320" 

/inference= "protein 
motif :SignalP: 2.0" 



mat -peptide 


join(71917. .72033, 


/locus-tag= 


"Anl8gi 


01320" 




72097. . 


,72406, 










72467. . 


,73353) 














/product="unnamed 




exon 


71863. . 


,72033 


/locus-tag= 
/number=l 


"Anl8gi 


01320" 


intron 


72034. . 


,72096 


/locus-tag= 
/number=l 


"AnlSgi 


01320" 


exon 


72097. . 


,72406 


/locus-tag= 
/number=2 


"AnlBgi 


01320" 


intron 


72407. . 


72466 


/locus-tag= 
/number=2 


"AnlBgi 


01320" 


exon 


72467. . 


73356 


/locus-tag= 


"Anl8g01320" 








/number=3 






gene 


<74203 . 


.>75729 


/ locus-tag= 


"Anl8g01330" 


mRNA 


join(<74203. .74720, 


/ locus-tag= 


"Anl8gi 


01330" 




74803. . 


,75129, 










75220. . 


,75442, 










75604. . 


.>75729) 








CDS 


join(74203. .74720, 


/locus-tag= 


"Anl8gi 


01330" 




74803. . 


75129, 










75220. . 


75442, 










75604. . 


,75729) 









/ codon-start=l 
/product=" hypothetical protein" 
/protein-id="CAK47206 .1" 
/db-xref="GI : 134084173" 
/translation="MPILPRLHLFEIADQPWCPD 
KAIEYVQLCLTHCWNLRLPPIAKA 
SSADVACDVLAENFPDISSFTFVDLGSGAGGPSS 
TLERLLNARLRAQHLPPAQFLLTD 
LNPHPREWAALTKQQENISYISESVDATKCDRLV 
PQIRKECRMFNVAFHHFDDPLAMP 
MLRSAIESADAFIIFELTARDFSSMLILPGLVLI 
AFQYTLLRFWRSPLHLLFTFVLPL 
APLLMVFDGFVSIMRCRTPDELHDLVRRSDAAGL 
ENWEFRSGKSPVMYPTTNVHWFMG 
VKKTAHTHTELVICVSTIKMTRWLSLMRTIAKVL 
QPSSSKVHSFKCGRQPGHPIGSLV 
SVGISGPGPRLCFYANIDHRAKMSTWTTGHSSYK 



exon 


74203. 


.74720 


/locus-tag= 
/number=l 


"Anl8c 


J01330' 


intron 


74721. 


.74802 


/locus-tag= 
/number=l 


"Anl8c 


J01330' 


exon 


74803. 


.75129 


/locus-tag= 


"Anl8g01330 








/number=2 






intron 


75130. 


.75219 


/locus-tag= 
/ number=2 


"Anl8c 


J01330' 


exon 


75220. 


.75442 


/locus-tag= 
/number=3 


"Anl8c 


J01330' 


intron 


75443. 


.75603 


/locus-tag= 
/number=3 


"Anl8c 


J01330' 


exon 


75604. 


.75729 


/locus-tag= 
/number=4 


"Anl8c 


J01330' 


gene 


<76028 


. .>77800 


/locus-tag= 


"Anl8c 


J01340' 


mRNA 


join(< 
76240. 
76459. 


76028 . . 76190, 

.76393, 

.>77800) 


/locus-tag= 


"Anl8c 


J01340' 


CDS 


join(76028. .76190, 


/locus-tag= 


"Anl8c 


J01340' 




76240. 


.76393, 










76459. 


.77800) 









/inference="prof ile:PFAM:PF00328" 
/note="unnamed protein product; 
Remark: the patent does not 
provide further information about 
the function of the protein, 
except for a claim of phytase 
activity. Similarity: the 
predicted A. niger protein shows 
strong similarity to protein 
sequence 11 from patent 
EP0684313-A/11 and several 
putative phytases. Title: strong 
similarity to sequence 11 from 
patent EP0684313-A - Unclassified 
organism" 
/ codon-start=l 
/protein-id="CAK47207.1" 
/db-xref="GI : 134084174" 
/translation="MHPTIATLSILAVASLAEAV 
AIEARSQNATPLTPSPSATTTGID 
WFQTTPESYQGTTATGVAPFLAETNPAPFGQKTL 
SPNDPLETSEPIEGAAGRNIFHYM 
GNLSPYHVPDGFGVDEYPLPKGSNITQMHMIHRH 
GSRYPSSSEGLASWAQKI INSTAS 
GNNFTGALSFLNDWDYGLGLEILVPKGRQELYDS 
GVLNFYNYGHLYNASSPHKLVART 
TTQDRMLKSAENFLAGFFGLEWTEKANLLPIIEG 
VGYNNSLIGTYSCTRALEYMAYNA 
TTPLSTWKNIYLKARTEALRTLTGSYNWTTTDSF 
NAQDMCAYETISYGYSQFCELFTF 
EEFENFGYAFDIEFANMVGFACPAGRAQGIAWVE 
EFLARVEGHLLQTTGTNANMTLDT 
NPVTFPTDQNLYLDFSHDAGIVAVLTAFGFRQFA 
ESLPATGPPLYQQFKSSKIVPFAG 
RTNIEIIKAPNQVAAIRPAGDQSDAYVDGTGETF 
YVHFLQNQRTLPLHSSFEECEYRD 
DGWCELSTFMAVQKKGLERSQFEYACFGNWTVTE 
YGTVTDGVPV" 

sig-peptide 76028.. 76084 /locus-tag="Anl8g01340" 

/ inf erence="protein 



mat -peptide 

intron 
exon 
intron 
exon 

gene 
mRNA 



join(76085. .76190, 
76240. .76393, 
76459. .11191) 

76028. .76190 

76191. .76239 

76240. .76393 

76394. .76458 

76459. .77800 

complement (<77988 . . 
19) 

complement (join ( 
.78215, 78264. .78 
78499. .79055, 
.79172, 
.79241, 
.79508, 
.79700, 
>79819) ) 
complement ( join ( 7 
78215, 78264 . . 7844 
78499. .79055, 
.79172, 
.79241, 
.79508, 
.79700, 
.79819) ) 



79113. 
79229. 
79298. 
79560. 
79753. . 



79113. 
79229. 
79298. 
79560. 
79753. 



77988. 
41, 



7988. 
1, 



motif :SignalP: 2.0" 
/locus-tag="Anl8g01340" 



/product=" unnamed" 
/locus-tag="Anl8g013 40" 
/number=l 

/locus-tag="Anl8g01340" 
/number=l 

/locus-tag="Anl8g01340" 
/number=2 

/locus-tag="Anl8g01340" 
/number=2 

/locus-tag="Anl8g01340" 

/number=3 

/locus-tag="Anl8g01350" 
/locus-tag="Anl8g01350" 



/locus-tag="Anl8g01350" 



/inference="profile: COGS :C0G2 141" 
/note="unnamed protein product; 
Remark: HA protein regulates 
homeostasis and adaptation, the C. 
glutamicum HA genes (I) can be 
used in vectors for expression in 
host cells and production of fine 
chemicals, such as, an organic 
acid, proteinogenic or 
nonproteinogenic amino acid 
(preferred) , purine or pyrimidine 
base, nucleoside, nucleotide, 
lipid, saturated or unsaturated 
fatty acid, diol, carbohydrate, 
aromatic compound, vitamin, 
cof actor, polyketide or enzyme, 
the amino acids produced can be 
lysine, glutamine, 
glutamate, alanine, aspartate, 
glycine, serine, threonine, 
methionine, cysteine, valine, 
leucine, isoleucine, arginine, 
proline, histidine, tyrosine, 
phenylalanine, or tryptophan, the 
fine chemical production can be 
modulated, the presence of (I) or 
HA proteins encoded by then are 
used for diagnosing the presence 
or activity of Corynebacterium 



complement (77988. 
complement ( 78216 . 
complement (78264. 
complement (78442. 
complement (78499 . 
complement ( 79056 . 
complement (79113. 



diphtheriae. (I) can be used to 

map the C. glutamicum genome or 

can be used as markers for 

genetically engineered 

Corynebacterium or Brevibacterium. 

the HA proteins encoded by the (I) 

are used to maintain homeostasis 

in C. glutamicum or help the 

microorganism to adapt to 

different environmental 

conditions. Similarity: the 

predicted A. niger protein shows 

strong similarity to HA protein 

sequence SEQ ID NO: 420 from patent 

WO200100842-A2 (AC# AAB79232 ) , 

which is a monooxygenase by 

similarity. Title: strong 

similarity to HA protein sequence 

SEQ ID NO: 420 from patent 

WO200100842-A2 - Corynebacterium 

glutamicum" 

/ codon-start=l 

/protein-id="CAK47208.1" 

/db-xref="GI : 134084175" 

/translation="MAIDNATSRPRKQLILNAFV 

EMCSGHQSPGLWRHPEDESHRFND 

IGHW I E LAKL L E S AKFHG I F I ADVLGGYDVYKGP 

RNLEPAIISGAQWPVNEPLAWPA 

MAAATQN I GFGVTVTTT YEQP YHLARRL S T I DHL 

TKGRIGWNIVTGYLDSAARNLGHT 

QQPQHDDRYAQAEE Y I KVT YKLWE S S WRS DAWL 

DRTRGIYTDPTRVREINHSGKYFT 

VPGPHICQPSPQRTPVILQAGTSKAGKAFAAQHA 

EAIFVAGHSPSWAKNIAEIRETA 

KTQFGRDPAGIKFLALLCPVLGRTEEEAQEKFRY 

FRSLGSIDGALALFGGWTGIDLDR 

YGDDEELRHVESNAIRSAVEGWSKATPEVEKWTK 

ATVGQHITVGGLGATPVGTPEQVA 

DEMERWVNEADVDGFNLAYAI KPGSFKD 1 1 DL L I 

PELRKRGLFWDDYAVKKGTYRENL 

Y S KEGQ S GPP ADHP AAKYRWHAGVE AADHP VPN Y 

.78215 /locus-tag="Anl8g01350" 

/ number=l 
.78263 /locus-tag="Anl8g01350" 

/number=l 
.78441 /locus-tag="Anl8g01350" 

/number=2 
.78498 /locus-tag="Anl8g01350" 

/number=2 
.79055 /locus-tag="Anl8g01350" 

/number=3 
.79112 /locus-tag="Anl8g01350" 

/number=3 
.79172 /locus-tag="Anl8g01350" 



intron 



/number=4 

complement (79173 . . 79228 /locus-tag="Anl8g01350" 



/ number=4 

complement (79229 .. 792 41 /locus-tag="Anl8g01350" 



/number=5 

complement (79242. .79297 /locus-tag="Anl8g01350" 



/number=5 

complement (79298 . .79508 /locus-tag="Anl8g01350 " 



/number=6 

complement (79509 . . 79559 /locus-tag="Anl8g01350" 



/number=6 

complement (79560 . . 79700 /locus-tag="Anl8g01350" 



/number=7 

complement (79701 . . 79752 /locus-tag="Anl8g01350" 



/number=7 

complement (79753 . . 79819 /locus-tag="Anl8g01350" 



<80602 


. .>83154 


join (< 


80602. .80707 


80740. 


.80815, 


80860. 


.80917, 


80965. 


.81151, 


81201. 


.81300, 


81343. 


.81426, 


81473. 


.81659, 


81704. 


.81754, 


81802. 


.81972, 


82019. 


.82337, 


82386. 


. 82882, 


82945. 


.>83154) 


join ( 8 


0602. .80707, 


80740. 


.80815, 


80860. 


.80917, 


80965. 


.81151, 


81201 . 


.81300, 


81343. 


.81426, 


81473. 


.81659, 


81704. 


.81754, 


81802. 


.81972, 


82019. 


.82337, 


82386. 


.82882, 


82945. 


.83154) 



/number=8 

/locus-tag="Anl8g01360" 
/locus-tag="Anl8g01360" 



/locus-tag="Anl8 



/inference="profile:COGS:COG0591 
/inf erence="similar to AA 
sequence : UniProtKB : SCDURI I I . 1 " 
/note="unnamed protein product; 
Function: S. cerevisiae DUR3 is 
urea symporter mediating urea 
import. Induction: high-level 
expression of S. cerevisiae DUR3 
is inducer dependent, requiring 
functional DAL81 and DAL82 genes 
Regulation: expression of S. 
cerevisiae DUR3 is regulated in 



exon 

intron 

exon 

intron 

exon 

intron 

intron 
exon 



80602. .80707 
80708. .80739 
80740. .80815 
80816. .80859 
80860. .80917 
80918. .80964 
80965. .81151 
81152. .81200 
81201. .81300 



manner similar to that of other 

genes in the allantoin pathway. 

Repression: DUR3 expression is 

highly sensitive to nitrogen 

catabolite repression and also has 

a partial requirement for the GLN3 

product. Repression: maintenance 

of S. cerevisiae DUR3 mRNA at 

uninduced, nonrepressed basal 

levels requires the negatively 

acting DAL80 gene product. 

Similarity: S. cerevisiae DUR3 

belongs to the major facilitator 

family. Title: strong similarity 

to urea transport protein Dur3 - 

Saccharomyces cerevisiae plasma 

membrane" 

/citation=[17] 

/codon-start=l 

/protein-id="CAK47209 .1" 

/db-xref="GI : 134084176" 

/translation="MSEIQARAAGASVQPPLSQA 

VGYVI VWLGL 1 1 AGADKAVMMVI 

TKILKKTTGEDNKKTEMFMTANRTVRTGLTASAV 

I S S WL WT T AML GAS F VG YD YGVAG 

PFWFAAGCSPMIVFFALIGISYKRKIPDAHTSLE 

WRIRYGRIAHAVFMTLCLINNIF 

ACANMLLGAAAVISAITGMHIIAATFLLPVGVTV 

YTFVGGIKATFLTDYFHTAIILII 

ACYLSVKAFTFEEVGSIGKLYELVQAAAQRHPVS 

GNQDGTYLTMTSKGAILFGILHIC 

SNFGLVIMDTSYFIKAFSAAPSSWPGYTIGGIA 

YFAIPWALGTIMSSLALGLENTAS 

FPTYPRRMTSTEVSNGLVLPYAAMTIAGKGGAAA 

VLLITFMAVTSTLSAQVIAVSSIL 

SFDVYREYINRAASDRDIIRASHFGVIFFAAFSA 

GFSTMLHYVGIDLGWTLYMLGWT 

CPGIFPMAFTILWRRQSRAAAILSPILGMATGIG 

VWLGTAQHFYGAVSVSSTGQILPC 

VYGTVASAFSPIVYSVLITLVKPQRYDWAEFRKE 

KLGLERLDSDSDITVNGQGSEEQQ 

NRTSFDPQELKRWGRIAAFWSIATFLGHWVLWPL 

PMYGSKYVFGKGFFTAWVIVGIIW 

LWITMLVAIFYPLLDGGMQQMLQISRALRGRREA 

VTSTSLPSVNNDSPEVFRVNEKS " 

/locus-tag="Anl8g0136 0" 

/number=l 

/locus-tag="Anl8g01360" 
/number=l 

/locus-tag="Anl8g01360" 
/number=2 

/locus-tag="Anl8g01360" 

/ number=2 

/locus-tag="Anl8g01360" 
/number=3 

/locus-tag="Anl8g0136 0" 
/number=3 

/locus-tag="Anl8g01360" 
/number=4 

/locus-tag="Anl8g01360" 
/number=4 

/locus-tag="Anl8g01360" 





81301 . 


.81342 




81343 . 


.81426 




81427 . 


.81472 






. 81659 




81660 . 






81704 . 


.81754 




81755 . 


.81801 




81802 . 


.81972 




81973 . 


.82018 




82019 . 


. 82337 




82338 . 


. 82385 


exon 


82386. 


.82882 


intron 


82883. 


.82944 


exon 


82945. 


.83154 


gene 


<83419 


. .>85018 


mRNA 


join (< 


83419. .83511 




83784. 


.83957, 




84093. 


.84185, 




84538. 


.84599, 




84694. 


.84802, 




84864. 


.>85018) 


CDS 


join (8 


3419. .83518, 




83784. 


.83957, 




84093. 


.84185, 




84538. 


.84599, 




84694. 


.84802, 










83419. 


.83518 


intron 


83519. 


.83783 


exon 


83784. 


.83957 


intron 


83958. 


.84092 



/number=5 

/locus-tag="Anl8g01360" 
/number=5 

/locus-tag="Anl8g01360" 
/number=6 

/locus-tag="Anl8g0136 0" 
/number=6 

/locus-tag="Anl8g01360" 
/number=7 

/locus-tag="Anl8g01360" 
/number=7 

/locus-tag="Anl8g01360" 
/number=8 

/locus-tag="Anl8g01360" 

/number=8 

/locus-tag="Anl8g0136 0" 
/number=9 

/locus-tag="Anl8g01360" 
/number=9 

/locus-tag="Anl8g01360" 
/number=10 

/locus-tag="Anl8g01360" 
/number=10 

/locus-tag="Anl8g01360" 
/number=ll 

/locus-tag="Anl8g0136 0" 
/number=ll 

/locus-tag="Anl8g01360" 
/number=12 

/locus-tag="Anl8g01370" 
/locus-tag="Anl8g01370" 



/locus-tag="Anl8g01370" 



/codon-start=l 

/prociuct=" hypothetical protein" 

/protein-id="CAK47210 . 1" 

/db-xref="GI : 134084177" 

/translation="MYADDPLSLSALPRQVYSLI 

ETFMVLLCGVCKTATLPLELLHII 

GDRLRQKGVPLAPCTTVSRWWKDVFEAYIYIRW 

VRSTKAELEAPLPGLDVLWPTAA 

GTE I YVTKDQP AI L TMRE ARGRRWN S P L VP T L GE 

EKVLSVGVDDNRPGAWKLLSQSLS 

QWF S S DQGPKYNRRF S DGTGVHDNTH I LTNNNRI 

ASLSLLDRMITMKLATKGKTRDPR 

TNSISLTEAPYQ" 

/locus-tag="Anl8g013 70" 

/ number =1 

/locus-tag="Anl8g013 70" 
/number=l 

/locus-tag="Anl8g01370" 
/number=2 

/locus-tag="Anl8g01370" 









/number=2 






exon 


84093. 


84185 


/locus-tag= 
/number=3 


'Anl8c 


J01370 


intron 


84186. 


84537 


/locus-tag= 
/number=3 


'Anl8c 


J01370 


exon 


84538. 


84599 


/locus-tag= 


'Anl8g01370 








/number=4 






intron 


84600. 


84693 


/locus-tag= 
/ number=4 


'Anl8c 


J01370 


exon 


84694. 


84802 


/locus-tag= 
/number=5 


'Anl8c 


J01370 


intron 


84803. 


84863 


/locus-tag= 
/number=5 


'Anl8c 


J01370 


exon 


84864. 


85018 


/locus-tag= 
/number=6 


'Anl8c 


J01370 


gene 


complement (<85096 . .>908 


/ locus-tag= 


'AnlBc 


J01380 




63) 










mRNA 


complement ( join (<85096 . 


/locus-tag= 


'Anl8c 


J01380 




.85140 


85234. .85322, 










85463. 


86109, 










86186. 


86269, 










86334. 


86527, 










86604. 


87025, 










87079. 


87143, 










87317. 


87548, 










87617. 


87964, 










88032. 


88194, 










88244. 


88541, 










88598. 


89091, 










89221. 


89335, 










89500. 


90057, 










90157. 


90274, 










90374. 


90575, 










90645. 


>90863) ) 








CDS 


complement (join (85096 . . 


/locus-tag= 


■AnlSc 


J01380 




85140, 


35234. . 85322, 










85463. 


86109, 










86186. 


86269, 










86334. 


86527, 










86604. 


87025, 










87079. 


87143, 










87317. 


87548, 










87617. 


87964, 










88032. 


88194, 










88244. 


88541, 










88598. 


89091, 










89221. 


89335, 










89500. 


90057, 










90157. 


90274, 










90374. 


90575, 










90645. 


90863) ) 









/inference="profile: COGS :C0G1 132" 
/inference="profile:PFAM:PF00005" 
/exception="reasons given in 
citation" 

/note="unnamed protein product; 
Function: YCFl from S. cerevisiae 
is required for cadmium 
resistance. S. cerevisiae cells 
harboring a deletion of the YCFl 
gene are hypersensitive to cadmium 
compared with wild type cells. 



Remark: mutagenesis experiments 
demonstrate that conserved amino 
acid residues, functionally- 
critical in the human cystic 
fibrosis transmembrane conductance 
regulator (hCFTR) , play a vital 
role in YCFl-mediated cadmium 
resistance. Remark: substitution 
of a serine to alanine residue in 
a potential protein kinase A 
phosphorylation site in a central 
region of YCFl, which displays 
sequence similarity to the central 
regulatory domain of hCFTR, also 
rendered YCFl nonfunctional. 
Remark: the YCFl gene of S. 
cerevisiae is an MgATP-energized, 
uncoupler-insensitive vacuolar 
membrane glutathione S-conjugate 
transporter. Similarity: the 
predicted A. niger protein shows 
strong similarity to cadmium 
factor (YCFl) from S. cerevisiae, 
which belongs to the ATP binding 
cassette (ABC) protein superfamily 
of membrane transporters. Title: 
strong similarity to cadmium 
resistance protein Ycfl - 
Saccharomyces cerevisiae [putative 
sequencing error] putative 
sequencing error" 
/citation=[25] 
/citation^ [27] 
/citation= [29] 
/citation= [33] 
/ codon-start=l 
/protein-id="CAK47211 . 1" 
/db-xref="GI : 134084178" 
/translation="MTMIATCEQIDESWGPWAQS 
CRGGFDFTLTFEDTILIILPSIIF 
IVASLIGVLCGRERRQLLHATPYLSTCKPSLHCV 
TIGLADIWENSNSSPHQYRSGGY 
ILRNSVSLSVWVSIAPHRGRTQSVPQSLPFSNN 
HLSSGPDEAPSLAYRGTGNAFEGA 
AKRALWSGPIPLASAYTVGCDLFGMYDAATAGHY 
NTRFGQYSRLSQVQGSKATSLLRA 
LFLTLRIELLIPALPRGVMIAVTLVQPLLLQRIL 
DFVQGEGYSERMSVGYGLIGACAL 
LYGLTSMFNAWYAHASNRLALQIRNVLVDAIYSK 
LLRLPIAKADPGLITTLINVDMEH 
IIEGARVIHDLWAAVISVGVSLYMIYWKLGLAPV 
NHDPWSLQGSKDVGTCRPIYEGI 
EETKDRRSSPGTLIWSILSTVPASASSQIALVAA 
YGGFAIVSRTRDEVMTTDTMFTSL 
ALLQISTDPLFMLIQETPLLVSAYKCIQRIQSFL 
EEHSSMQAKGTLSLQETVDAHGKD 
IFELNCKPLPDASVSLDGEECGTLIKFHNACYSG 
GNTEGEVLLSDLTLVIRQGSAVIV 
AGSIGSGKSSLLKAILGELCFVSGYSYVRPHLRM 
AYCAQEPWLLNDTIRNNILGGRAM 
DSGWYQQVLEAVHLLPDLDSLSERDSTIIGHGGS 
RLSGGQRQRISLARALYSRPQLLL 



complement ( 85096 . . 
complement ( 85141 . . 
complement (85234. . 
complement (85323 . . 
complement (85463 . . 
complement ( 86110 . . 
complement ( 86186 . . 
complement (86270 . . 
complement ( 86334 . . 
complement (86528. . 
complement ( 86604 . . 
complement (87026. . 



ADDILSGLDKKTEQGIIGAVFSPFGLLKRLECTV 
VLATHSDLCISKFDDVITMDQGRI 
LFRGSSKKTSNIEEMIGNRNLGGNDNFENLKTAS 
TPTESITVASVDSVARSLPDEDES 
AATAPSDWSIYLFFARSMSVTGFLVFVALAMAIG 
AERSFESVWLQDWAESKHPDHVAY 
YVGIFTGLWGGLILLYITCWSVLITLRSILVTR 
LQPGFPNSVFFEFLMTSSSIGIYG 
DQFIQDIMLLDNDLPMAWLNTFTALCAALGETII 
VLLSSKYLVISVPFLVGGLXVLQR 
FYLRTSKQLRLMDIESKAPLSGFVNDTISGLLTV 
RAFGRIDEFQCQARQLLQTSQAAN 
YMLLSIQVWLKMILDFIVMILAVAVTGLAVGLRS 
QRS VGFLGLAL VNL I S L S S T I RYV 
MTFWVDLETSIGAAARIRQFNISMTPEDDPSNRP 
EHKAGFANGNVAFQDVSAVNGTSG 
DMILKDVTFSVREGQKVAICGRTGSGKSALLGTL 
LRLLDIQRGWTVGDEPITGFSRD 
SIRGSFLSLPQKPLLFANSVRYNMDPWGRHADSQ 
DFDTYARESLQQVGIWEVFEQLYH 
QSLSDAGAHFPIPKSALDLDIDTDGSLTPGQQQL 
FCLARLLCRLRQESSKFPVILLDE 
VTSSVDQETERTMRCLLRNSLQKHTVLEVIHRLK 
KDAVREDYDLVLVLDQGQIVEFGP 
PAALLNTPGVSSSGSSLYIVDNLIDVRCSIALLL 
PDWREKRGLAQWMEKTEL " 
) /locus-tag="Anl8g01380" 

/number=l 
i /locus-tag="Anl8g01380" 

/number=l 
I /locus-tag="Anl8g01380" 

/number=2 
I /locus-tag="Anl8g01380" 

/number=2 
5 /locus-tag="Anl8g01380" 

/number=3 
) /locus-tag="Anl8g01380" 

/number=3 
) /locus-tag="Anl8g01380" 

/number=4 
3 /locus-tag="Anl8g01380" 

/number=4 
7 /locus-tag="Anl8g01380" 

/number=5 
i /locus-tag="Anl8g01380" 

/number=5 
3 /locus-tag="Anl8g01380" 

/number=6 
i /locus-tag="Anl8g01380" 



complement ( 87079 . 
complement ( 87144 . 
complement (87317. 
complement (87549 . 
complement ( 87617 . 
complement ( 87965 . 
complement (88032 . 
complement (88195. 
complement (88244. 
complement (88542. 
complement (88598. 
complement (89092. 
complement (89221. 
complement (89336 . 
complement (89500. 
complement (90058 . 
complement ( 90157 . 
complement (90275. 
complement (90374. 
complement (90576 . 
complement (90645. 



87143 /locus-tag= 

/number=7 
87316 /locus-tag= 

/number=7 
87548 /locus-tag= 

/number=8 
87616 /locus-tag= 



/number=8 
/locus-tag= 



/number=9 
/locus-tag= 



/number=9 
/locus-tag= 



/number=10 
/locus-tag= 



/number=10 
/locus-tag= 



/number=ll 
/locus-tag= 



/number=ll 
/locus-tag= 



/number=12 
/locus-tag= 



/number=12 
/locus-tag= 



/number=13 
/locus-tag= 



/number=13 
/ locus-tag= 



/number=14 
/locus-tag= 



/number=14 
/locus-tag= 



/number=15 

/locus-tag= 



/number=15 
/ locus-tag= 



/number=16 
/locus-tag= 



/number=16 
/locus-tag= 



"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"Anl8g01380' 
"AnlBgOiaSO' 
"Anl8g01380' 
"Anl8g01380' 



/number=17 

complement (<9 148 4. .>930 /locus-tag="Anl8g01390" 
88) 

complement (<9 148 4. .>930 /locus-tag="Anl8g01390" 
88) 

complement (91484. .93088 /locus-tag="Anl8g0139 0" 
) 

/inference="prof ile:PFAM:PF0 0 0 46" 
/note="unnamed protein product; 
Function: the CUPl gene, and its 
copper-dependent transcriptional 
activator ACEl play a key role in 
mediating copper resistance in S. 
cerevisiae. Similarity: the 
predicted A. niger protein shows 
similarity to the copper 
homeostasis protein CUP9 from S. 
cerevisiae. the predicted A. niger 
protein shows strong similarity to 
the homeobox domain of many 
homeotic proteins. Title: 
similarity to copper homeostasis 
protein Cup9 - Saccharomyces 
cerevisiae nucleus" 
/citation=[26] 
/ codon-start=l 
/protein-id="CAK47212 .1" 
/db-xref="GI : 134084179" 
/translation="MEHAHQAFPKLCDDILSLPP 
LHDEDAVSYPATVDPIFSLPSELP 
DSFSPLPSQPDATSVSPDSGGPAPPTRPPQRLSV 
EAVRTLKSWLNNHPENPYPTTQEK 
DELAQRTSLTRAQVSNWFINARRRKRSSGYMSAR 
QSPSFLSPMERWRSSPPESEAAAT 
DDILRALADSDISAFSDSLPYTVPQYAWSSNDSS 
GSFVLGDASISSCGRSQSSSSEKS 
VALSSHRPPQRPPTPLHRPSSRRHRRKHIRHANR 
LQWTRRPYQCTFCADTFATKYDWQ 
RHEKALHLPVDQWRC SPEGGL 1 1 DNNGTPVCVFC 
QQANADEDHLETAHNYSTCLEKPS 
EQRIFTRKDNLRQHLKLTHRVDTFHAAMLTWRES 
RGHRILSRCGFCSATFQSWQERVD 
HVAEHFKHGADMKQWMGDWGFEPEAQRLVENAIP 
P YL I GREAQTP DPWRT S DVFPMPR 
EEDKEDDEIPFRWDVPTALDRYFDVHRDLLAFIR 
EQMSNGCRPSDQMIQDRARLFAYE 
SDDPWNQTYADDLCWLEAVKQEAGLV" 

complement (91484. .93088 /locus-tag="Anl8g01390" 

) 

/number=l 

<93571. .>94963 /locus-tag="Anl8g01400" 
join (<93571 . .93941, /locus-tag="Anl8g01400" 
93988. .94242, 
94300. .94738, 
94796 . .>94963) 

join(93571. .93941, /locus-tag="Anl8g01400" 
93988. .94242, 
94300. .94738, 
94796. .94963) 

/inference="profile: COGS: COGO 477" 
/inference="profile:COGS:COG2271" 
/note="unnamed protein product; 



Function: the MCT3 transporter 
from Rattus norvegicus is a 
proton-linked monocarboxylate 
transporter, it catalyzes the 
rapid transport across the plasma 
membrane of many monocarboxylates 
such as lactate, 

pyruvate, branched-chain oxo acids 
derived from leucine, valine and 
isoleucine, and the ketone bodies 
acetoacetate, beta-hydroxybutyrate 
and acetate. Similarity: the 
predicted A. niger protein shows 
strong similarity to the 
monocarboxylate transporter 3 
(MCT3) of R. norvegicus, which 
belongs to the major facilitator 
superfamily. Title: strong 
similarity to monocarboxylate 
transporter 3 MCT3 - Rattus 
norvegicus" 
/citation= [44] 
/citation=[45] 
/codon-start=l 
/protein-id="CAK47213 .1" 
/db-xref="GI : 134084180" 
/translation="MEASRLADAEVPEGGAGWIV 
lAGCAVITWWFIGTSYCWGVLQAA 
L VKDGVS S AS T L S F VG S LAP AC I S F LG I LNARVI 
RKLGTRTSALLGIFLLGLGEILSG 
FAVHEVGGLFVTSGWMGLGTS I SFMWS I TPAQ 
YFKAKRGIANGIVYAAGGLGGAAI 
SFILDALLSRVGTAWTFRILGFITMGTGLPAAFL 
VKQRI P I PP S AFVEWRLFRD I RFL 
LLFAAGAIATFPLLVPPFFLPLYTDSLGLGSAAG 
AGWAAFNFSSALGRLTCGFASDT 
IGGLNTLFVSLLLSALSMLIIWPVSTSIGPLWF 
VIINGMANGGFFSTIPTWGNVFG 
SARVSVAMGMIVTSWAGGYLLGSPIAGYILDASG 
GEDAGIKAYRPAILYAGFMALGAS 
ILAAFIRLKTDTRLLKKV" 



axon 


93571. 


.93941 


/locus-tag= 
/ number=l 


"Anl8c 


J01400' 


intron 


93942. 


.93987 


/locus-tag= 
/ number=l 


"Anl8c 


J01400' 


exon 


93988. 


.94242 


/locus-tag= 
/number=2 


"Anl8c 


J01400' 


intron 


94243. 


.94299 


/locus-tag= 
/number=2 


"Anl8c 


J01400' 


exon 


94300. 


.94738 


/locus-tag= 
/number=3 


"Anl8c 


J01400' 


intron 


94739. 


.94795 


/locus-tag= 
/number=3 


"Anl8c 


J01400' 


exon 


94796. 


.94963 


/locus-tag= 
/number=4 


"Anl8c 


J01400' 


gene 


<95725 


. .>97215 


/locus-tag= 


"Anl8c 


J01410' 


mRNA 


join (< 
95882. 
96649. 


95725. .95803, 

.96591, 

.>97215) 


/locus-tag= 


"AnlSc 


J01410' 


CDS 


join (9 
95882. 
96649. 


5725. .95803, 

.96591, 

.97215) 


/locus-tag= 


"Anl8c 


J01410' 



exon 

intron 

exon 

intron 

exon 

gene 

mRNA 

CDS 



95725. .95778 

join(95779. .95803, 
95882. .96591, 
96649. .97212) 

95725. .95803 

95804. .95881 

95882. .96591 

96592. .96648 

96649. .97215 

complement (<97301 . .>98? 

81) 

complement (join(<97301. 
.98523,98588. .>98981) ) 
complement (join (97301. . 
98523,98588. .98981) ) 



/inference="profile:PFAM:PF03663" 

/inf erence="similar to AA 

sequence :PIR:T52516" 

/note="unnamed protein product; 

Title: strong similarity to 

hypothetical protein B2J23.120 - 

Neurospora crassa" 

/ codon-start=l 

/protein-id="CAK47214.1" 

/db-xref="GI : 134084181" 

/translation="MLLRIWHILYTFIVWQAID 

YDVDDPDSIKAACHSVARQMLTHY 

TGNQPGDNPGNLPDPYYWWEAGAMFTALVDYWYL 

TSDDTWNNITTQGITWQAGPSGSF 

MPANQTRTEGNDDQSFWAFAAMSAAERNFPDPPP 

SSGSPGWLAMAQAVFNTQAARWDK 

S TCNGGLRWQ I F TFNNGWT YKNT I SNGCFFNL AA 

RLAKYTGNSTYADWADTVWDWTGE 

VGFMTDTYRFWDGADVSSGCGDWNYIEWTYNTGV 

YLLGAAVMYNLTESPVWKARTEGI 

LNASFVFFQDDVMYERACEPVSTCQVDQRSFKGY 

LARWMAATTQMAPFTYDLVMPKLR 

ASAKAAAETCTGGELQATCGLKWTDRKWDGMDDV 

GIQMAALEVMQSTLISRVDPPVTQ 

DTGGT SQGNPAGGEPGPPP APVPEGLRLE I TKAD 

RAGAGMMTVMLSMIVIGSTGWMVY E" 

/locus-tag="Anl8g01410" 

/ inf erence="protein 

motif :SignalP: 2.0" 

/locus-tag="Anl8g01410" 



/product=" unnamed" 

/locus-tag="Anl8g01410" 

/number=l 

/locus-tag="Anl8g01410" 
/number=l 

/locus-tag="Anl8g01410" 
/number=2 

/locus-tag="Anl8g01410" 
/number=2 

/locus-tag="Anl8g01410" 
/number=3 

/locus-tag="Anl8g01420" 

/locus-tag="Anl8g0142 0" 
/locus-tag="Anl8g0142 0" 
/codon-start=l 

/product="hypothetical protein" 

/protein-id="CAK47215 . 1" 

/db-xref="GI : 134084182" 

/translation="MVRGRPRPSRFTRTSPSCAS 

GTTRMCRNRRPLQARRVMQVYSNL 

TCSYPIPTIMPALSVLPADILLHVISFLDHSGDV 

YSFSLLSPSLFTLIHSNDLVNKTR 

YRCVRINCTPDLKNAYTILLDILHDCNLANMIAL 

LTQATCSVSVRPPPGPALSVRNRR 

DYSTWPCRRQTTLRIGSISHTTSAFSITVIEITT 

VSNPASTQEDAYHGKSTFIGQALT 

VLLIASSPNLTSLAISPPFWEYTSSRFNDPSDNH 



HFAAKIEKYPLAKFLKHANSSHLI 

EKGERKIPYLANVKHFQILTTEASSFHIARSRAC 

TPQDLCLCIELVRTLPSIESLRMQ 

AIQGFGNASSAALDGLLPPSSTNISRLDLRGCDF 

KSDLLCKVICSMKCLKELRYSWQ 

GGCYAGFAEFFDRYSLIEAILRHRATLEVLDLEF 

DDQLSQFVAVERESGVQDDAEDKS 

VLKEFISLRSLSLGVSCLWFLATGMGELGNIVLM 

DHLPLQLEYLRIRGYEKGHTKTLD 

NMLAGME S ARE GHP ALC T VQGVNE Y I PVGK " 

complement (9 7301 . .98523 /locus-tag="Anl8g01420" 

) 

/number=l 

complement (98524. .98587 /locus-tag="Anl8g01420" 
) 

/number=l 

complement (98588 . .98981 /locus-tag="Anl8g01420" 
) 

/number=2 

complement (<100363 . .>10 /locus-tag="Anl8g01430" 
1496) 

complement (join (<100363 /locus-tag="Anl8g01430" 

. .101141, 

101214. .>101496) ) 

complement (join (100363 . /locus-tag="Anl8g01430" 
.101141, 

101214. .101496) ) 

/inference="prof ile:COGS:COG1064" 
/inference="prof ile:PFAM:PF00107" 
/inf erence="similar to AA 
sequence : PIR : S456 05" 
/note="unnameci protein product; 
Catalytic activity: alcohol + NAD+ 
= aldehyde or ketone + NADH. 
Pathway: alcohol dehydrogenase 
ADH-T from B. stearothermophilus 
is involved in glycolysis / 
gluconeogenesis; fatty acid 
metabolism; bile acid 
biosynthesis; tyrosine metabolism; 
glycerolipid metabolism. Remark: 
the protein sequence of alcohol 
dehydrogenase ADH-T from B. 
stearothermophilus NCA1503 is 
covered by patent JP04218378-A 
(AC# AAR26874) . Similarity: the 
predicted A. niger protein shows 
strong similarity to thermostable 
alcohol dehydrogenase ADH-T from 
B. stearothermophilus NCA1503, 
which belongs to the zinc alcohol 
dehydrogenase (ADH) family. Title: 
strong similarity to thermostable 
alcohol dehydrogenase adhT - 
Bacillus stearothermophilus" 
/citation=[15] 
/citation=[58] 
/codon-start=l 
/protein-id="CAK47216 .1" 
/db-xref="GI : 134084183" 
/translation="MEYTFKVFCGSSDGKWEKL 
TTRRLGDNDVFIETTHSGLCGTDE 



complement (100363. .1011 
41) 



complement (101142. .1012 

13) 



HFLHCDQALGHEGVGWKHVGPSVSSVKVGDRVG 

FGFIRRVCGRCDNCISGCDHHCRE 

KRAYGQHDFDVGSFSHGTVWDADAVYPIPEGYDS 

AHAAP L LC AGAS VWAC L TNNG I RP 

S DRVGVMG I GGLGHL AI KLARALRYNWAL S S SE 

KKREEALEFGASEFYRFPNTQTPN 

HIKPVKHLLLCGSSDVDYASWVSHLPSKQNAIDH 

ANIYRSLLDLVDTNGTIYHISVTL 

KPTPIPLVPFGQKGIRIQGCFITSRRNLQELLEF 

AARFDIKPTIMTFPLTRNGLEETI 

EKLRAGRIRYRAVLEYQAP " 

/locus-tag="Anl8g01430" 

/number=l 

/locus-tag="Anl8g01430" 



complement (101214. .101' 
96) 



<101875. 
join (<10: 
102203. 
102259. 
102392. 
102621. 
103205. 
join (101 
102203 
102259 
102392 
102621 
103205 



104542 
1875. .101958 
102219, 
102360, 
102399, 
103153, 
>104542) 
875. .101958, 
102219, 
102360, 
102399, 
103153, 
104542) 



/number=l 

/locus-tag="Anl8g01430" 
/number=2 

/locus-tag="Anl8g01440" 
/locus-tag="Anl8g01440" 



/locus-tag="Anl8g01440" 



/inference="profile: COGS :C0G3 496" 

/note="unnamed protein product; 

Similarity: the predicted A. niger 

protein shows local similarity to 

the hypothetical protein mll8086 

from M. loti. Title: strong 

similarity to hypothetical protein 

CAD70872.1 - Neurospora crassa" 

/codon-start=l 

/protein-id="CAK47217. 1" 

/db-xref="GI : 134084184" 

/translation="MTAKCMVIKKAEEDHTEFLG 

LHHVGSDKNFDNCVTKEVRGPQIL 

KKFLLRQWIPAQGGREQQPESLILSPVRKNLPSI 

S VDG I LLHAAL LTVNL I LKLGGWP 

GLLKFMLRTFAISSVLAPIGLIGFLFLVSFTPRN 

IKSLSKDKSVIGKPLLFPITLDHT 

RLSPIKNNFTFNVLFVGIPVGISCRFGRLLSIDA 

KHTDEEECTERSSLLRLLQTYFSS 

WFSFDSARYLHRGDDTLSLENKLNKFLREQNENP 

AKWPYAYMLSVPRFLWWERSWTW 

WYLYSESKELDAVIMEINNSFDEKRNVLFKVRRT 

RIYTESPEKGFEQLLDCKEEHLDE 

DKRVF S L I PQHGKYAYKATWKKE I F S SPFEKVGE 

TVSSTFLDPWPSSWSGNRSLSNT 

TTFDPSGAPRMIARLWCKVPPIDPGKASSFQIFS 

ILLIWTVNVTLATPRILFQAIRLH 

VMNLMRMMEHPDVRPGSEPRRPSKGERKDAPLKM 

LHSLFDLHANLQSRMLERFFREYL 



KHIVASYPGDLEVTYIPCKSVFKTTICLRSTQYS 
AEGQDPRRLRLEVMDPAFYSRIVN 
SPNAGTAMAKETKPGRSPADALSSPVIASDITQS 
LQLLDVTVSENRTYDKACSRISSW 
VLCLRRWLTSSFMDSFVCKALPPRTQEEYVSCLI 
QLWLEKLTWDFLSPHQIYRVIRAA 
MLQWAICWILF" 



exon 


101875. 


.101958 


/locus-tag= 
/ number=l 


"Anl8c 


J01440' 


intron 


101959. 


.102202 


/locus-tag= 
/number=l 


"Anl8c 


J01440' 


exon 


102203. 


.102219 


/locus-tag= 
/number=2 


"Anl8c 


J01440' 


intron 


102220. 


.102258 


/locus-tag= 
/number=2 


"Anl8c 


J01440' 


exon 


102259. 


.102360 


/ locus-tag= 
/number=3 


"AnlBc 


J01440 


intron 


102361. 


.102391 


/locus-tag= 
/number=3 


"Anl8c 


J01440' 


exon 


102392. 


.102399 


/locus-tag= 
/number=4 


"Anl8c 


J01440' 


intron 


102400. 


.102620 


/locus-tag= 
/number=4 


"Anl8c 


j01440' 


exon 


102621. 


.103153 


/locus-tag= 
/number=5 


"Anl8c 


J01440' 


intron 


103154. 


.103204 


/locus-tag= 
/number=5 


"AnlBc 


J01440 


exon 


103205. 


.104542 


/locus-tag= 
/number=6 


"AnlBc 


J01440 


gene 


<106231 


. .>106803 


/locus-tag= 


"AnlBc 


J01450' 


mRNA 


join(<106231. .106565, 


/locus-tag= 


"AnlBc 


J01450' 




106647. 


.>106803) 








CDS 


join(106231. .106565, 


/locus-tag= 


"AnlBc 


J01450 




106647. 


.106803) 









/EC-number = " 1.14.-.-" 
/note="unnameci protein prociuct; 
Function: orciA of A. parasiticus 
is involved in the aflatoxin 
biosynthesis and converts 
0-methylsterigmatocystin (OMST) to 
aflatoxins Bl or Gl and converts 
dihydro-O-methylsterigmatocystin 
(DHOMST) to aflatoxins B2 or 02. 
Remark: aflatoxins comprise a 
group of polyketide-derived 
carcinogenic mycotoxins. Remark: 
it is assumed that the ORF is 
N-terminally shorter and has 
another start codon 5 ' to the 
predicted one ; the ORF is around 
400 amino acids shorter than most 
of the homologues cytochrome p450 
proteins. Remark: ordA of A. 
parasiticus is also called 
cytochrome p450 64, cyp64 or omst 
oxidoreductase . Similarity: the 
ORF shows similarity to several 
cytochrome P450 related proteins 
from different species. Title: 
strong similarity to 
0-methylsterigmatocystin 
oxidoreductase ordA - Aspergillus 



exon 
intron 

exon 



106231. .106565 
106566. .106646 

106647. .106803 

<107279. .>108340 
join(<107279. .107327, 
107380. .107740, 
107795. .107907, 
107966. .107988, 
108045. .108097, 
108151. .>108340) 
join(107279. .107327, 
107380 . .107740, 
107795. .107907, 
107966. .107988, 
108045. .108097, 
108151. .108340) 



parasiticus" 

/citation=[43] 

/codon-start=l 

/protein-id="CAK47218 . 1" 

/db-xref="GI: 13408 4185" 

/translation="MTFNPERFLTTESYQAEHDP 

HNLAFGFGRRICPGRGFADSTIFL 

TWRSLQAFRIAKISEDGREIEPIVEYLPGVISH 

PKPFAISITPRSKEHESFIRSIEI 

EHPWEKGDLVS I LSVNVLVWASKLRVFT I LT SL 

YRLPLIGNCKPPNLLLGRMGLIRF SLH" 

/locus-tag="Anl8g01450" 

/number=l 

/locus-tag="Anl8g01450" 

/number=l 

/locus-tag="Anl8g0145 0" 
/number=2 

/locus-tag="Anl8g01460" 
/locus-tag="Anl8g01460" 



/locus-tag="Anl8 



/EC-number="l 

/inference="profile:COGS:COG102 8" 
/inference="prof ile:PFAM:PF0 0106" 
/note="unnamed protein product; 
Catalytic activity: cyclohexanol 
dehydrogenases convert 
cyclohexanol + NAD(+) <=> 
cyclohexanone + NADH. Function: 
chnA of A. sp. is an alcohol 
dehydrogenases proposed to 
catalyze the conversion of 
cyclohexanol to cyclohexanone (EC 
1. 1. 1. 245). Phenotype: 
cyclohexanol was detected as the 
major intermediate accumulated in 
the chnA mutant of A. sp. Remark: 
ChnA of A. sp. is encoded in the 
gene cluster for cyclohexanol 
oxidation. Similarity: the ORF 
shows similarity to several 
dehydrogenases from different 
species and with various 
specificities. Title: strong 
similarity to cyclohexanol 
dehydrogenase chnA - Acinetobacter 
sp" 

/citation=[57] 

/codon-start=l 

/protein-id="CAK47219 .1" 

/db-xref="GI : 134084186" 

/translation="MAHLLQGTAFITGAGSGIGA 

GVARTFIQNGISKLALVDINLSIL 



QKVSESLKQTFPNVELLTSAVDVTNEVHVDNAVQ 

KAATTFGRVDIGVNSAGVGDPGEP 

TDRL S LKDWQRTVH INQTGVWL SQRAL I QQML TQ 

KSRGTRHGRGVIVNVCSSLGITAS 

ATGIPFPAYTSSKHGVMGLTKMDAKYYAPSGIRI 

NAVCPGFVDTPMVSKAVDSGVFKK 

EVESAPVGRIGDVEEITDSILFLASPMSSFIYGA 

GLWDGGYGL" 

sig-peptide join ( 107279 .. 107327, /locus-tag="Anl8g01460" 

107380. .107387) 

/ inf erence="protein 
motif :SignalP: 2.0" 



mat -peptide 


join(107388. .107740, 


/locus-tag= 


"Anl8gl 


D1460' 




107795. 


.107907, 










107966. 


.107988, 










108045. 


.108097, 










108151. 


.108337) 


/product="u 


nnamed' 




exon 


107279. 


.107327 


/locus-tag= 
/number=l 


"AnlBgl 


31460' 


intron 


107328. 


.107379 


/locus-tag= 
/number=l 


"AnlSgl 


D1460' 


exon 


107380. 


.107740 


/locus-tag= 
/number=2 


"Anl8gl 


31460' 


intron 


107741. 


.107794 


/locus-tag= 
/number=2 


"Anl8gl 


31460' 


exon 


107795. 


.107907 


/locus-tag= 
/number=3 


"Anl8gl 


31460' 


intron 


107908. 


.107965 


/locus-tag= 
/number=3 


"AnlBgl 


31460' 


exon 


107966. 


.107988 


/locus-tag= 
/number=4 


"AnlBgl 


31460' 


intron 


107989. 


.108044 


/locus-tag= 
/number=4 


"AnlBgl 


31450 


exon 


108045. 


.108097 


/locus-tag= 
/number=5 


"AnlBgl 


31460' 


intron 


108098. 


.108150 


/locus-tag= 
/number=5 


"AnlBgl 


31460' 


exon 


108151. 


.108340 


/locus-tag= 
/number=6 


"AnlBgl 


31460' 


gene 


<108826 


. .>111209 


/locus-tag= 


"AnlBgl 


31470' 


mRNA 


join(<108826. .108944, 


/locus-tag= 


"AnlSgl 


31470' 




109020. 


.109023, 










109089. 


.109305, 










109377. 


.109778, 










109841 . 


.109905, 










110160 . 


.110181, 










110244. 


.110293, 










110355. 


.110492, 










110524. 


.110752, 










110866. 


.110960, 










111135. 


.>111209) 








CDS 


join(108826. .108944, 


/locus-tag= 


"AnlBgl 


31470' 




109020. 


.109023, 










109089. 


.109305, 










109377. 


.109778, 










109841 . 


.109905, 










110160. 


.110181, 










110244. 


.110293, 










110355. 


.110492, 










110524. 


.110752, 










110866. 


.110960, 









111135. .111209) 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

intron 
exon 



108826. 
108945. 

109020. 
109024. 
109089. 
109306. 
109377. 
109779 . 
109841. 
109906. 
110160. 



.108944 
.109019 

.109023 
.109088 
.109305 
.109376 
.109778 
.109840 
.109905 
.110159 
.110181 



/EC-number= "1.14.-.-" 
/inference="profile: COGS :COG0 493" 
/note="unnamed protein product; 

Remark: aflatoxins are 
polyketide-derived secondary 
metabolites. Remark: moxY of A. 
parasiticus is expressed 
concurrently with genes involved 
in aflatoxin biosynthesis and it 
lies on one end of the cluster of 
this genes. Therefore moxY of A. 
parasiticus is presumably also 
involved in aflatoxin 
biosynthesis. Similarity: the ORF 
shows similarity to monooxigenases 
from several species and with 
different functions. Title: strong 
similarity to monooxygenase moxY 
-Aspergillus parasiticus" 
/citation= [55] 
/ codon-start=l 
/protein-id="CAK47220 . 1" 
/db-xref="GI : 134084187" 
/translation="MANEQTCSPSINLNTHENMC 
KGRIQQRRYPATCNMLKKPRCVPH 
TQIELCTVLFAFSNMISASTTDTFTQVIIIGAGM 
SGLAMACQLKKQLCCEDFVIYDRA 
PSFGGTWYFNKCCGVDIPAAFYSFSFALYPQFTC 
FFPKQEEILQYIHGVADEFSVALK 
LVGHTEWEGADWQDSEQCWEVRLREIPSGRKFTR 
RCRILISAVGGLTNPKHVMLQGIE 
RFQGNIVHTALWDQETAVAGKNVIVIGNGASATQ 
FIPAIADDAASINQFIRHVRKCAP 
EEQYWSLLTPEYSIGCKRRVFDNDGYLKCLHRPN 
VDITNDPWAVEEQSITTQSGKRF 
PADLIADVDLDYGRCLRLGSLSRSTMSTGEWMG 
VRDKN I GTVS AAYKHLRRWPWRNF 
LTFSTFSDPTLAGAIHQLSIQLKVSWIKQASEE 
RFNTKLKS ALRKTVFTNMCRS LPA 
STISLVRVLTWEEKPSLSLL" 
/locus-tag="Anl8g01470" 
/ number=l 

/locus-tag="Anl8g01470" 
/ number=l 

/locus-tag="Anl8g01470" 
/number=2 

/locus-tag="Anl8g01470" 
/number=2 

/locus-tag="Anl8g01470" 
/number=3 

/locus-tag="Anl8g01470" 

/number=3 

/locus-tag="Anl8g01470" 
/number=4 

/locus-tag="Anl8g01470" 
/number =4 

/locus-tag="Anl8g014 70" 
/number=5 

/locus-tag="Anl8g01470" 
/number=5 

/locus-tag="Anl8g01470" 



intron 


110182. . 


.110243 


/locus-tag="Anl8g 


01470" 








/number=6 




exon 


110244. . 


.110293 


/locus-tag="Anl8g 


01470" 








/number=7 




intron 


110294. . 


.110354 


/locus-tag="Anl8g01470" 








/number=7 




exon 


110355. . 


.110492 


/locus-tag="Anl8g 


01470" 








/number=8 




intron 


110493. . 


.110523 


/locus-tag="Anl8g 


01470" 








/number=8 




exon 


110524. . 


.110752 


/locus-tag="Anl8g 


01470" 








/number=9 




intron 


110753. . 


.110865 


/locus-tag="Anl8g 


01470" 








/number=9 




exon 


110866 . . 


.110960 


/ locus-tag="Anl8g 


01470" 








/number=10 




intron 


110961. , 


.111134 


/locus-tag="Anl8g 


01470" 








/number=10 




exon 


111135. . 


.111209 


/locus-tag="Anl8g 


01470" 








/number=ll 










/locus-tag="Anl8g 


01480" 


mRNA 


<111907. 


. .>113460 


/locus-tag="Anl8g 


01480" 


CDS 


111907. . 


.113460 


/locus-tag="Anl8g 


01480" 








/inf erence="prof i 


le:COGS:COG2124" 








/inf erence="prof i 


le:PFAM:PF00067" 








/inf erence="similar to AA 








sequence :UniProtKB: ABO 13 4 43.1" 



/note="unnameci protein product; 
Function: eln2 of C. cinereus 
encodes a novel type of microsomal 
cytochrome P450 enzyme, with is 
involved in mushroom 
morphogenesis. Phenotype: a 
dominant mutation of the 
elongationless2 (eln2) gene of the 
mushroom C. cinereus affects 
pattern formation in the 
development of fruit body 
primordia, causing dumpy primordia 
which culminate in mature fruit 
bodies with short stipes. 
Similarity: the ORF shows 
similarity to several cytochrome 
p450 related proteins from 
different species , which have 
different cellular functions. 
Title: strong similarity to 
cytochrome p450 related protein 
eln2 - Coprinus cinereus" 
/citation= [54] 
/ codon-start=l 
/protein-id="CAK47221 . 1" 
/db-xref="GI : 134084188" 
/translation="MPTILLTLLALLTLRLLYEY 
KRDRNLPPGPRRLPLIGNLHQAPQ 
TLPWRVFDQWSKTYGPIMSAQFGRQTLILITSPT 
lARDLLDKRGS I YADRPDLVMANN 
ITKGLHMLIRQYDDWLRLHQRLDAPLLSPRASNT 
YHPIQDLESKQLMFDLLRSNDFDA 
HFERYSGSLMFALAYGFRLLSPKGQELRDMRTIQ 



sig-peptide 

mat -peptide 

exon 

gene 

mRNA 

CDS 



111907. .111957 

111958. .113457 
111907. .113460 



AWAPWKRLAEKLFKLEASVHTRHLEKGLNSEPW 

NWSKEFAASKHAEGMPRLDLAYNL 

GILVDAGFETTWTVMKIFVLAMRSDPRFVAVARK 

ELDEWGEDRMPTFEDQEKLVYIQ 

AWDETLRWRSMAPGGIPHAARKEDTYMGYRIPK 

GATVIPLFWSMCLTDEPWDDPLEF 

RPERWFEATEKEEGRFRNFFGYGRRICTGRHIAR 

NSLFLLMARILWAFDIQAPLGDDG 

KPVPVDDMAFDSAFVSTPEPFEALFVPRSEKTKE 

IVEREWNEMEKDMAVLMGQVRDSQ 

RALGLDVRA" 

/locus-tag="Anl8g01480" 
/inference= "protein 
motif :SignalP: 2.0" 
/locus-tag="Anl8g01480" 
/product="unnamed" 
/locus-tag="Anl8g01480" 
/number=l 
11 /locus-tag="Anl8g01490" 



complement ( <113 
4733) 

complement ( join (<113565 /locus-tag="Anl8g01490 

. .114153, 

114210. .>114733) ) 

complement ( join ( 113565 . /locus-tag="Anl8g01490 
.114153, 

114210. .114733) ) 



/inference="prof ile:COGS :COG0b9 8" 
/inference="prof ile:PFAM:PF01544" 
/inf erence="similar to AA 
sequence:UniProtKB:SC41293 .1" 
/note="unnamed protein product; 
Function: air 2 of S. cerevisiae is 
an uptake transporter for 
inorganic cations, e. g. 
magnesium, which presence seems to 
be important for resistance to the 
toxic effect of aluminum. Remark: 
alr2 of S. cerevisiae is also 
called YFL050C. Similarity: the 
predicted ORF is 395 amino acids 
shorter at the N-terminus and 57 
amino acids shorter at its 
C-terminal end than alr2 of S. 
cerevisiae (nearly the same is 
true for alrl of S. cerevisiae). 
Title: strong similarity to ion 
transporter Alr2 -Saccharomyces 
cerevisiae plasma membrane" 
/citation=[40] 
/codon-start=l 
/protein-id="CAK47222 .1" 
/db-xref="GI : 134084189" 
/translation="MSSDSTPDPRLEKPRVAVAS 
RLNFFTSQLNTSIEASSIEDLCSV 
YRPFELLLETGAHSGLWWLDITAPSEEDIEALAR 
FFNLHPLTTEDIKTRETREKIELF 
GQYYFLSLRPPRRLETDTGVRIVSHNLYAWFRG 
GVLSFSFDPSLHTSHVRQRIKEHS 
SHLLLTSDWICYALIDDIVDGFAPFISRVENGW 
TVEDSVSITRPDDMGLALQRIFKL 
RKEVMNIRQPLHDKIDVIRSFARHCDISDTSSSQ 
VALYLSDICDHWTMIANLEQAEQ 



gene 
mRNA 



gene 
mRNA 



complement (113565. .1141 

53) 

complement (114154.. 11 42 

09) 

complement(114210. .1147 
33) 

complement (<115549 . .>11 

6444) 

complement ( join (<115549 
. .115734, 
115791. .116016, 
116080. .>116444) ) 
complement (join (115549 . 
.11573 4, 115 791.. 116016, 
116080. .116444) ) 



complement (115549. .1157 
34) 



MLSRLQSKYLTQVHFDSGRMRNGIASALSKLTVL 
AS I LVPMQF I TGLFGMNVRVPGKT 
HDGDNSLTWWFSILGFILGLTVIFAWVAKRIGLL 
DR" 

/locus-tag="Anl8g0149 0" 
/number=l 

/locus-tag="Anl8g0149 0" 
/number=l 

/locus-tag="Anl8g0149 0" 
/number=2 

/locus-tag="Anl8g01500" 
/locus-tag="Anl8g01500" 



/locus-tag="Anl8 



/note="unnamed protein product; 

Title: strong similarity to 

hypothetical protein encoded by 

Anl5g02540 - Aspergillus niger" 

/codon-start=l 

/protein-id="CAK47223 .1" 

/db-xref="GI : 134084190" 

/translation="MSDQQLLLERQARRHALAKV 

EEHIKQTPNMHVEPSDLKAASIRH 

FPLSLEGTKDQPWFFRPYEEELPLPVEEAEFLQI 

ELTPDNIMWDTRALELFLFDHFHD 

CRGFAKCEYPQNPPYGVYREIGDVKFGQLLECGK 

FNWYAVSVTDYPGENLPHIKAIVE 

NDAIGDDKLLRGEIMTITDIMKARLRSNTLRPHI 

VAPML VL S LMG P RH ARVL E ADF DG 

VMLNIRASGLYDFTRKNTDAVQLLTRYWLGGACG 

QTTKKS" 

/locus-tag="Anl8g01500" 



complement (115735. .1157 
90) 



complement (115791 . 
16) 



complement ( 116017 . 
79) 



complement (116080. .1164 

44) 



/number=l 

/locus-tag="Anl8g01500" 
/ number=l 

/locus-tag="Anl8g01500" 
/number=2 

/locus-tag="Anl8g01500" 
/number=2 

/locus-tag="Anl8g0150 0" 

/number=3 
. /locus-tag="Anl8g01510" 



complement (<117672 . 
8234) 

complement ( join (<1 176 72 /locus-tag="Anl8g01510" 
. . 117782, 
117885. .118021, 
118123. .>118234) ) 

complement (join (117672 . /locus-tag="Anl8g01510" 
.117782, 117885. .118021, 



118123. .118234) ) 



gene 
mRNA 
CDS 



complement ( 117672 . 
82) 

complement ( 117783 , 
84) 

complement (117885, 
21) 

complement (118022, 
22) 

complement (118123, 
34) 

<119176. .>120297 
<119176. .>120297 
119176. .120297 



/note="unnamed protein product; 
Remark: the ORF is questionable 
due to its suboptimal intron-exon 
structure and short lenght Title: 
questionable ORF" 
/codon-start=l 
/protein-id="CAK4 722 4.1" 
/db-xref="GI: 13 40 84191" 
/trans lation="MREQDNDRFMLPITLPNHCH 
SGSDVGLPPFACQSLGWQTNAVLG 
PVYGKFAESIAIILLCIFGQWDEDLVTRGNIIIS 
GVDGE T L TKVL LP VL TMKRT WLKG 
YHLLIGGVAVCLCVCFV" 
.1177 /locus-tag="Anl8g01510" 

/number=l 
.1178 /locus-tag="Anl8g01510" 

/number=l 
.1180 /locus-tag="Anl8g01510" 

/number=2 
.1181 /locus-tag="Anl8g01510" 

/number=2 
.1182 /locus-tag="Anl8g01510" 

/number=3 

/locus-tag="Anl8g01520" 
/locus-tag="Anl8g01520" 
/locus-tag="Anl8g01520" 
/EC-number="l 

/inference="profile:COGS:COG06 73" 
="prof ile:PFAM:PF01408" 
"similar to AA 
sequence : UniProtKB : ENQUTCH . 2 " 
/note="unnamed protein product; 
Function: due to the presence of a 
putative 'zinc cluster' motif and 
its low (16%) but significant 
similarity with the DNA-directed 
DNA polymerase of hepatitis B 
virus, it was assumed that qutH of 
A. nidulans is a DNA-binding 
protein, which is possibly 
involved in the regulation of 
genes essential for the 
utilisation of protocatechuic 
acid. Function: due to the 
similarity of the ORF to 
oxidoreductases, it is here 
assumed that the ORF is encoding a 
protein with oxidoreductase 
activity. Similarity: the ORF 
shows similarity to several 
oxidoreductases from different 
species. Title: strong similarity 
to quinic-acid utilisation gene 
qutH - Aspergillus nidulans" 
/citation=[10] 
/ codon-start=l 



/infe: 
/inf erence= 



sig-peptide 

mat -peptide 

exon 

gene 

mRNA 



119176. .119226 



119227. .120294 
119176. .120297 

complement (<120373 . .>12 

1787) 

complement (join(<120373 
. .120856, 
120912. .121419, 
121479. .121515, 
121584. .>121787) ) 
complement (join (120373 . 
.120856, 120912.. 121419, 
121479. .121515, 
121584. .121787) ) 



/protein-id="CAK47225.1" 

/db-xref="GI : 134084192" 

/translation="MLNVIIVGAGLIGPRHAQSV 

I TNP S TNL L AL VDP S P S AS S VAKT 

LNTLYFPSLSALLSSVPNIPHPDAAIICTPNHTH 

VPVALELISHNIHILLEKPISDTI 

TTALPLLQAQQKYPDVKILIGHHRRFNPYITKTK 

EILESGSLGPIIALSGLWTLYKPA 

SYFTGATEWRRDKAKGGVLSINLIHDVDLLHYLF 

GP I TRVYAEKT LPQRGGPDGNENH 

TAEEGAAITFRFASGWGTFWSDCAPSPWNFEA 

GTGENPIIPKVGGVGGGLYRVLGS 

RGSLSVPDLKRWSYDGVEGEKGWNQRLQVEEFEV 

DEGVPFDLQLEHFVRVLERGETPR 

CDAVEGLRALVWDAVKRAMETEEWKVESVEEI 

LARNE" 

/locus-tag="Anl8g0152 0" 

/inference= "protein 

motif :SignalP: 2.0" 

/locus-tag="Anl8g01520" 

/product="unnamed" 

/locus-tag="Anl8g01520" 

/number=l 

/locus-tag="Anl8g01530" 
/locus-tag="Anl8g01530" 



/locus-tag="Anl8g01530" 



/note="unnamed protein product; 
Function: pthll of M. grisea is 
involved in host surface 
recognition for the initiation of 
appressorium formation, which is 
important in plant infection. 
Phenotype: M. grisea pthll mutants 
are nonpathogenic due to a defect 
in appressorium differentiation. 
Remark: on the same contig another 
ORF shows similarity to pthll of 
M. grisea, see 130cg. Similarity: 
the ORF is 85 amino acids shorter 
at its N-terminus and 155 amino 
acids shorter at the C-terminal 
end than pthll of M. grisea. 
Title: strong similarity to 
integral membrane protein PTHll - 
Magnaporthe grisea" 
/citation= [52] 
/citation= [60] 
/codon-start=l 
/protein-id="CAK47226 .1" 
/db-xref="GI : 134084193" 
/translation="MAVDPAIVAIFGEPPDDIDL 
TDSRVQQDNAWIFILCLAVISW 
LRFVARRVLRNSLMADDWVIILALVFICTTSGLS 
I SGGFYGAGKHVWAI SVTKLVTLF 



gene 
mRNA 
CDS 



complement (120373 , 
56) 



complement ( 120857 . 
11) 



complement ( 120912 , 
19) 



complement (121420 , 
78) 



complement (121479, 
15) 



complement ( 121516 , 
83) 



complement (121584, 



<122486. .>123895 
<122486. .>123895 
122486. .123895 



KVSSTPHSTWRALTSQILYIYVMVYSAVCCAAK 
VSILFFYRRVFMASHADLSLRIGI 
YLGFFLTLSYP I I ICVTMATACKPASYFWNQFGG 
ASGTCIDTDTFFLALGIINMLNDI 
WLLIPFPQIAKLQMNRRKKVAISGILAVGSFAC 
VASIVRIYSVDQFTKRTDVTWTLG 
PVFIWSTIEPGLSLVCACLPHLAPLVRLAHIKLS 
SSQDSKRSRPTNPSTPWRSRSGNA 
NGNGTPSAGRRVHSVLSTKYTFDGQDDEEIGLTN 
HVTTSASLRKPHSIESVEREGGGS 
VDQSIMVHSSFVQSVSNK" 
.1208 /locus-tag="Anl8g01530" 

/number=l 
.1209 /locus-tag="Anl8g01530" 

/number=l 
.1214 /locus-tag="Anl8g01530" 

/number=2 
.1214 /locus-tag="Anl8g01530" 

/number=2 
.1215 /locus-tag="Anl8g01530" 

/number=3 
.1215 /locus-tag="Anl8g01530" 

/number=3 
.1217 /locus-tag="Anl8g01530" 

/number=4 

/locus-tag="Anl8g01540" 
/locus-tag="Anl8g01540" 
/locus-tag="Anl8g01540" 
/EC-number = " 2.3.1.-" 
/inf erence="similar to AA 
sequence:UniProtKB:AB014491 . 1" 
/note="unnamed protein product; 
Function: TRIlOl of F. 
sporotrichioides conferred 
significant increased tolerance to 
the mycotoxin DAS (trichothecene 
4, 15-diacetoxyscirpenol ) , which is 
harmful for vertebrates. Function: 
TRIlOl of F. sporotrichioides 
converts isotr ichodermol to 
isotrichodermin and is required 
for the modification of T-2 toxin, 
which inhibits protein synthesis 
in eukaryotes. Phenotype: TRIlOl 
mutants of F. sporotrichioides 
were altered in their abilities to 
synthesize T-2 toxin and 
accumulated isotrichodermol and 
small amounts of 
3, 15-didecalonectrin and 
3-decalonectrin, trichothecenes . 
Title: strong similarity to 
trichothecene 

3-0-acetyltransferase TRIlOl - 
Fusarium sporotrichioides" 



/citation=[48] 
/citation= [53] 
/citation= [56] 
/codon-start=l 
/protein-id="CAK47227 . 1" 
/db-xref="GI : 134084194" 
/translation="MGSIAPDVPTFTPLELIGPR 
GYMRYVFTFPLGEDYVLEDVSRAL 
KAG YNAAAQRL P I L ACE AI P DP T ARQD S VMKL QM 
LGPNAEEPILVKDLRTQDTLPSYA 
ELKTQHFPVAAFNADTLCRRDI WPTDLGEHLP I S 
LVQANF I RGGL I LTWC I LHMAGDG 
NSFQTWMRVWAEECRRAQGVKITDPWLPPAIFA 
DREKVMRSSGRNPGRIEDHPEYW 
LPFKPPEGLPPKMLAKNHRAQVFYFSPEALAALK 
KEASPKNASEPRDVPWISTNDALS 
ALLWRTVMAVQWPLETLESVNQDQESFFNIALNG 
RLRTDPPVHPNTLGCFLEYIGVSM 
PVHKMLTSAQLANLAVLIRQALHQVSNQHTDDVI 
SLIEQLDDIRKIFPASLVDLPGFN 
CILTSWVDFALYNLDWGTLLGGHIESVRAPHVGV 
INGLQWLPNPREGGMEVLVGVAD 
GCLEKLLNDPLWMKYAVAR" 
122486. .123895 /locus-tag="Anl8g01540" 
/number=l 

complement (<123984. .>12 /locus-tag="Anl8g01550" 
7145) 

complement (join (<1239 8 4 /locus-tag="Anl8g01550 " 

. .124060, 

124183. .125132, 

125181. .125432, 

125485. . 125732, 

125913. .>12714b) ) 

complement (join (123984. /locus-tag="Anl8g01550" 
. 124060, 124183 . . 125132, 
125181 . . 125432, 
125485. .125732, 
125913. .127145) ) 

/note="unnamed protein product; 
Title: similarity to hypothetical 
protein AAM43672.1 - Dictyostelium 
discoideum" 

/ codon-start=l 

/protein-id="CAK47228 . 1" 

/db-xref="GI : 134084195" 

/translation="MRPIVADVNDNDGHIELKRG 

LCAVLETIESSGSFLTQDLGLGLA 

IPGLNIAGLGNIRLPISADDAKAITQCCDRSPYG 

KGSETLVDESVRKTWQLDPGQFSL 

QNPLWQQQMDDWDDAVTGLGLTAQSDEVQAELY 

KLLIYEEGAFFLPHQDTEKADGMF 

ATLWCLPSKHEGGTLVASHRGWKIAWSTASSSE 

FSFSWAAWYADVIHEVRPVTSGYR 

VALVYNLIHRPLVGLGISGVQTDKLTHLLESWVS 

DCSGNGQSDHSAWDHHINGDCPPA 

LVYVLEHQYTWELSFDRLKGVDQVRFGELQKAC 

QGLDFDLYLANIEKTRMGGVDGRY 

GN S YWKAHNAL E DVLGGNQKF LSWDASGSEVGK 

RLPFHGKLLIQEDFFSNRLPDDEK 

YQGFTGNEAAKATRVYRATLFSEQPDNALTKRNL 

VRACQT I LKKH S S S SEMRAKVMQ I 

AIDVDDVILFCQTLVSLEGTMSSSEITQTAKVLA 



complement (123984. .1240 
60) 



complement (124061. .1241 



KHGLEAIRLALERVFQDKPHGPSL 

PERVSCISKLFREYRAICQERGRAPSVEMLNWES 

AIISNYLSCYSNDTEPDGRHLVDV 

LTPLPQEGLFERIIPFLETKIERTACLGKFFVSV 

Y VY SRRGKFDQ D AVDAML KN L L P K 

FFRGFRVQYKHTSNSGLEFAANCRPTKHTTSIRI 

DPKTWGLIKLGDVMDVDNTALFH 

ALTEYTLDVQARDMPDVFTDFLLPVANGVCENVI 

TSGGPSTNSERRFVKHILNKYITG 

YVLPAPLVPPDWKGRSCILCRCYTCASLDTFIED 

PAVRTRKVHMGGNDMEHFDRYLDD 

FFFTKTWGPVGMQMLRIWKTDAMMLVSQLHAWN 

KRARDAKAQLDQLDKHRSLKEILG 

NTYDSLMNHPNLTLPDDHPAATSTAIQAGHPMSG 

SWPNITLYRQIDDEYMSACTDEC TNAST" 

/locus-tag="Anl8g01550" 

/number=l 

/locus-tag="Anl8g01550" 



complement (124183 . 
32) 



complement (125133. .1251 

80) 



/number=l 

/locus-tag="Anl8g01550" 



/number=2 

/locus-tag="Anl8g01550" 



complement(125181. .1254 
32) 



complement (125433 . . 1254 



complement ( 125485 . . 1257 
32) 



complement (125733 . . 1259 
12) 



complement (125913. .1271 
45) 



joi: 
128 
128 
128 
128 
128 
129 
joi: 
128 
128 
128 
128 



7862. .>129230 
n(<127862. .127973, 
149 . . 128238, 
128432, 
128684, 
128854, 
129026, 
>129230) 
n(127862. .127973, 
149 . . 128238, 
310 . . 128432, 
128684, 
777. .128854, 
935. .129026, 
125. .129230) 



/number=2 
/locus-tag="Anl8 

/number=3 
/locus-tag="Anl8 

/number=3 
/locus-tag="Anl8 

/number=4 
/locus-tag="Anl8 

/number=4 
/locus-tag="Anl8 

/number=5 

/locus-tag="Anl8 

/locus-tag="Anl8 



01560" 
01560" 



310. 
566. 
777 . 
935. 
125. 



/locus-tag="Anl8g01560" 



/note="unnamed protein product; 
Remark: the ORF is questionable 
due to its suboptimal intron-exon 
structure. Title: questionable 
ORF" 



/codon-start=l 

/protein-id="CAK47229 .1" 

/db-xref="GI : 134084196" 

/translation=" MQHAAGKP AADDMRNKAL HA 

ISLGQAFHGSSPGRAVSAKERTSI 

VNPAQMTADMVGVFRLFHWDGWGVHYSPNRLEFL 

DWRSVASSGDEGKSEFPDGVWTGQ 

ALESIANMQLRGLPLRVPGDFRQFHPPLLFSLAK 

DFFIPVDVANSITAKPPETIPGQS 

LSAISPFLSCGQMRGTNAVGSPDPLSGKSCASHQ 

RVSASTLAFRTGGSRIPGYRTEEQ 

MATPCHEPWLSRRARRIPNVY" 



exon 


127862. 


.127973 


/locus-tag= 
/number=l 


"Anl8c 


J01560' 


intron 


127974. 


.128148 


/locus-tag= 
/number=l 


"Anl8c 


J01560' 


exon 


128149. 


.128238 


/locus-tag= 
/number=2 


"Anl8c 


J01560' 


intron 


128239. 


.128309 


/locus-tag= 
/number=2 


"AnlBc 


J01560' 


exon 


128310. 


.128432 


/locus-tag= 
/number=3 


"AnlBc 


J01560' 


intron 


128433. 


.128565 


/locus-tag= 
/number=3 


"Anl8c 


J01560' 


exon 


128566. 


.128684 


/locus-tag= 
/number=4 


"Anl8c 


J01560' 


intron 


128685. 


.128776 


/locus-tag= 
/number=4 


"Anl8c 


J01560' 


exon 


128777. 


.128854 


/locus-tag= 
/number=5 


"AnlBc 


J01560' 


intron 


128855. 


.128934 


/locus-tag= 
/number=5 


"AnlBc 


J01560' 


exon 


128935. 


. 129026 


/locus-tag= 
/number=6 


"AnlBc 


J01560 


intron 


129027. 


.129124 


/locus-tag= 
/number=6 


"AnlBc 


J01560' 


exon 


129125. 


.129230 


/locus-tag= 
/number=7 


"AnlBc 


J01560' 


gene 


<129407 


. .>130916 


/locus-tag= 


"AnlBc 


J01570' 


mRNA 


join(<129407. .129556, 


/locus-tag= 


"AnlBc 


J01570' 




129604. 


.129720, 










129768. 


.129958, 










130008. 


.130130, 










130176. 


.130256, 










130321. 


.130729, 










130866. 


.>130916) 








CDS 


join (129407 . . 129556, 


/locus-tag= 


"AnlBc 


J01570' 




129604. 


.129720, 










129768. 


.129958, 










130008. 


.130130, 










130176. 


.130256, 










130321. 


.130729, 










130866. 


.130916) 









/note="unnamed protein product; 
Function: pthll of M. grisea pthll 
is involved in host surface 
recognition for the initiation of 
appressorium formation, which is 
important in plant infection. 
Phenotype: M. grisea pthll mutants 
are nonpathogenic due to a defect 
in appressorium differentiation. 



Remark: on the same contig another 
ORF shows strong similarity to 
pthll of M. grisea, see 160wg. 
Similarity: the ORF is 92 amino 
acids shorter at the N-terminal 
end and 165 amino acid its 
C-terminus than pthll of M. 
grisea. Title: similarity to 
integral membrane protein PTHll - 
Magnaporthe grisea" 
/citation=[52] 
/citation= [60] 
/codon-start=l 
/protein-id="CAK47230 . 1" 
/db-xref="GI : 134084197" 
/translation="MAIYGGAAPSLAGSVIMLTT 
LALLTYGLRIYCRVTRKSWSVEDW 
I MT AAL VPF AVL VAGC LGGAFNG I G I RD S RL AE P 
QNVKYQAEGQKFFLIFEVGYCSAI 
I P I KL S I S WML I RVAEGRKAYL YAQYWI WF VL 
MN II AL I F I L I NCI PVDAAWNTEL 
LKQGGHCQPS Y VL ADVY YAC T AVN I L T DWVT ALM 
PVPLLWNVQLNRNTKI S I VGLMGL 
GIFASMSACVRLKYTVALTSQSNYLYSVTNWIW 
GFTENALGMIVGNVATLRPLFRIL 
RDRKTSSNNKYNSRGYYSSQRTGPANMYSRNYEL 
AEQGKHTNQITTTSMADHTRRPSQ 
MSDGDSQKQILAGGTPPGDTDILKKELNDSLQLQ 
FSKQA" 



sig-peptide 129407 .. 129493 /locus-tag="Anl8g01570" 

/ inf erence="protein 
motif : SignalP : 2 . 0 " 



mat -peptide 


join (129494 . . 129556, 


/locus-tag= 


"Anl8gC 


)1570' 




129604. 


. 129720, 










129768. 


.129958, 










130008. 


.130130, 










130176. 


.130256, 










130321. 


.130729, 










130866. 


.130913) 














/product="u 


nnamed" 


axon 


129407. 


.129556 


/locus-tag= 
/number =1 


"AnlSgC 


)1570' 


intron 


129557. 


.129603 


/ locus-tag= 


"Anl8g01570 








/number=l 






axon 


129604. 


.129720 


/ locus-tag= 
/number=2 


"Anl8gC 


)1570' 


intron 


129721. 


.129767 


/locus-tag= 
/ number=2 


"Anl8gC 


)1570' 


axon 


129768. 


.129958 


/locus-tag= 
/numbar=3 


"Anl8gC 


)1570' 


intron 


129959. 


.130007 


/locus-tag= 
/number=3 


"Anl8gC 


)1570' 


axon 


130008. 


.130130 


/ locus-tag= 


"Anl8g01570 








/number=4 






intron 


130131. 


.130175 


/ locus-tag= 
/number=4 


"Anl8gC 


)1570' 


axon 


130176. 


.130256 


/locus-tag= 
/ number=5 


"Anl8gC 


)1570' 


intron 


130257. 


.130320 


/locus-tag= 
/numbar=5 


"Anl8gC 


)1570' 


axon 


130321. 


.130729 


/locus-tag= 
/numbar=6 


"Anl8gC 


)1570' 



intron 
exon 
gene 
mRNA 

CDS 



gene 
mRNA 



130730. .130865 
130866. .130916 

complement (<131646 . .>13 

2375) 

complement ( join (<131646 

. .132295, 

132363. .>132375) ) 

complement (join (131646. 

.132295, 

132363. .132375) ) 



complement (131646. .1322 
95) 

complement ( 132296 . . 1323 
62) 

complement ( 132363 . . 1323 
75) 

complement (<133082. .>13 

5202) 

complement (join(<133082 
. .133214, 
133277. .134939, 
135000. .135045, 
135113. .>135202) ) 



/locus-tag="Anl8g015 70" 
/number=6 

/locus-tag="Anl8g01570" 
/number=7 

/locus-tag="Anl8g01580" 
/locus-tag="Anl8g01580" 

/locus-tag="Anl8g01580" 



/EC-number= "4.2.1.1" 
/inference="prof ile:COGS:COG0288" 

/inference="prof ile:PFAM:PF0 0 48 4" 
/inf erence="similar to AA 
sequence :UniProtKB: ABO 40 135 . 1" 
/note="unnamed protein product; 
Catalytic activity: peal of P. 
purpureum converts H(2)CO(3) <=> 
CO (2) + H(2)0 with the usage of 
zinc as cof actor. Function: 
carbonic anhydrases catalyze the 
reversible hydration of carbon 
dioxide. Remark: an alternate name 
for peal of P. purpureum is 
gtpcal. Similarity: the ORF shows 
similarity to several carbonic 
anhydrases from different species 
and with various cellular 
functions. Title: strong 
similarity to carbonic anhydrase 
peal - Porphyridium purpureum" 
/eitation= [32] 
/eitation= [62] 
/eodon-start=l 
/protein-id="CAK47231 . 1" 
/db-xref="GI : 134084198" 
/translation="MAVTDLHSQQPEKPQDPYTK 
ALTLNHLWAQKTTLQNPSLFPTLA 
RAQHPQILWIGCSDSRCPETTLLDLKPGDVFVHR 
N I ANWNAADVNCAAWE YAVLHL 
KVKHWVCGHTCCGGVGAVLAAPKGERDGEDGEK 
SVLDAWLSSLKRVRDRYASELEGI 
HGEYERGVRLVELNVLEGVRVLMAMGWREAVEK 
GEVEVHGAVYNVGCGWRELELEV KL" 
/locus-tag="Anl8g01580" 

/number=l 

/locus-tag="Anl8g01580" 
/number=l 

/loeus-tag="Anl8g01580" 
/number=2 

/loeus-tag="Anl8g0159 0" 
/locus-tag="Anl8g01590" 



complement (join (133082. /locus-tag="Anl8g01590 " 
.133214, 133277. .134939, 
135000. .135045, 
135113. .135202) ) 

/EC-number= "2.3.1.7" 

/inf erence= "profile :PFAM:PFO 0755" 

/inf erence="similar to AA 

sequence : PIR : S6 8958" 

/note="unnamed protein product; 

Catalytic activity: cat2 of C. 

tropicalis catalyses acetyl-CoA + 

carnitine <=> CoA + 

0-acetylcarnitine. Function: cat2 

of C. tropicalis is specific for 

the acetylation of short chain 

fatty acids and may also be 

involved in the transport of 

acety-CoA into mitochondria. 

Similarity: the ORF overlaps with 

A. niger EST EMBLEST : BE760390 . 

Title: strong similarity to 

carnitine 0-acetyltransf erase cat2 

- Candida tropicalis inner 

membrane" 

/ codon-start=l 

/protein-id="CAK47232 .1" 

/db-xref="GI:1340 8 4199" 

/translation="MFAASSRNRLSTLPRTRLSP 

AN S L L ARS AVT P I MAPRRKAS S VP 

EGYKEDLSKGNMLRFEDSLPRLPVPSLEETGRRY 

LKSVHALVSEAEYERTKKAVEAFV 

RPGGEGEPLQERLLARAADPKIKNWLLEWWNHAA 

YLGYRDPVIPYVSYFYSYRDDRAR 

RDPAKRAASVATAALEFKRQVDDGSLEPEYMRGQ 

PMAMSTYQYMFNCCRIPGDSVDYP 

QKFPAQDNEHIVWRKNQFFKVPLTVNGKRLNNS 

ELQRQFERIYEVAQPAPAVGVLTV 

ANRDLWADARKKLLAAHPANEQALRD I E S S GF L V 

CLDNATPVTLEERAHQYWHGDGTN 

RWFDKPLQFIINDNGTAGFMGEHSMMDGSPTHRM 

NDHLNNLIFNHKIDLSEKPVRSDL 

PDPRP I TFHLDGPVLEAI DAANKEHRQQI S SHEL 

RVQAYQG YGKG L I KKFKC S P D AYV 

QMIIQLAYFKMYGKNRPTYESASTRKFQEGRTET 

I RT VS DE S VAF C KA I S DP AVP REE 

AVRLLRSALAAHTKYTAEASDGRGVDRHLFGLKK 

LLREGEKLPALYEDPAFAYSSSWY 

LSTSQLSSEFFNGYGWSQVIDDGFGIAYMINENS 

LNFNIVCKRLGAERMSYYLNEAAS 

DMRDMLMP DL AAQ S EKAKM " 

complement (133082. .1332 /locus-tag="Anl8g01590" 

14) 

/number=l 

complement (133215. . 1332 /locus-tag="Anl8g01590" 
76) 

/number=l 

complement (133277. . 1349 /locus-tag="Anl8g01590" 
39) 

/number=2 

complement (134940 . . 1349 /locus-tag="Anl8g01590" 
99) 

/number=2 



gene 
mRNA 



complement (135000. .1350 
45) 

complement (135046. .1351 
12) 

complement (135113. .1352 
02) 

complement (<135940 . .>13 
7166) 

complement (join (<135940 
. .136119, 
136175. .136476, 
136534. .136864, 
136972 . .>137166) ) 
complement (jo in (135940. 
.136119, 136175. .136476, 
136534. .136864, 
136972. .137166) ) 



complement (135940. .1361 
19) 



complement (136120. .1361 



/locus-tag="Anl8g01590" 
/number=3 

/locus-tag="Anl8g01590" 
/number=3 

/locus-tag="Anl8g01590" 
/number=4 

/locus-tag="Anl8g01600" 
/locus-tag="Anl8g01600" 



/locus-tag="Anl8g01600" 



/inference="prof ile:COGS:COG0515" 
/note="unnamed protein product; 
Function: srpk2 was first found to 
mediate the interaction and 
localization of pre-mRNA splicing 
factors by phosphorylating the SR 
(serine/arginine-rich) family of 
splicing factors and therefore 
probably controls the activity of 
splicing. Phenotype: 
overexpression of srpk2 in M. 
musculus causes disassembly of 
cotransfected SF2/ASF and 
endogenous SC35. Similarity: the 
ORF shows similarity to several 
protein kinases from different 
species and with various 
specificity. Title: strong 
similarity to serine/arginine-rich 
protein-specific kinase srpk2 - 
Mus musculus deleted EC-number 
2.7.1.37" 
/citation=[41] 
/codon-start=l 
/protein-id="CAK47233 .1" 
/db-xref="GI : 134084200" 
/translation="MASLLKWIRASIHPRRAPLP 
PRRFPTTGFTI IPASTILEEEHFN 
SFTNGHYYPVNIGDVFASKYQMIRSHRYVTLKIY 
TRDESHTEEFKTYQHISQANPSHP 
GYGYVRTALDTFIIPRPGGDHPCLVQQPMWDSFT 
DLLFRNPRHRFTEDLLKAGLMQVF 
LALDYLHTECRWHTDIKGGNILQEIRDNSLMEA 
FTAGEMERP S S RKL ADGVP VYAS R 
RFGLPISFGRAVLSDFGSAVRGDESRDHDAQPAV 
YRSPEVMLQVGWSYLVDIWNVGVM 
IWDLFEGKHMFYGEDPDGKGYSTRAHLAEVIGLL 
GHPPLDLLQRGKRSHEFFTEDGML S" 
/locus-tag="Anl8g01600" 

/number=l 

/locus-tag="Anl8g01600" 



gene 
mRNA 



complement ( 136175 . 
76) 

complement ( 136477 . 
33) 

complement ( 136534 . 
64) 

complement ( 136865 . 
71) 

complement (136972. .1371 
66) 

complement (<138020 . .>14 
0061) 

complement (join (<138020 
. .139287, 
139338. .139723, 
139790. .>140061) ) 
complement (join (138020. 
.139287, 139338. .139723, 
139790. .140061) ) 



/number=l 

/locus-tag="Anl8g01600" 



/number=2 

/locus-tag="Anl8g01600" 



/number=2 

/locus-tag="Anl8g01600" 



/number=3 

/locus-tag="Anl8g01600" 



/number=3 

/locus-tag="Anl8g01600" 



/ number=4 

/locus-tag="Anl8g01610" 



/locus-tag="Anl8g01610" 



/locus-tag="Anl8g01610" 



/inf erence="similar to AA 
sequence :PIR:A40 711" 
/note="unnamed protein product; 
Function: srb4 of S. cerevisiae is 
a component of the multi 
subunitcomplex, which binds to RNA 
polymerase II and is essential for 
efficient establishment of the 
transcription initiation 
apparatus. Phenotype: the S. 
cerevisiae srb4 the null mutant is 
inviable and srb4 mutants display 
global defects in mRNA synthesis. 
Remark: an alternate name for srb4 
of S. cerevisiae is YER022w. 
Title: strong similarity to RNA 
polymerase II suppressor protein 
Srb4 - Saccharomyces cerevisiae 

/citation=[22] 

/ codon-start=l 

/protein-id="CAK47234. 1" 

/db-xref="GI : 134084201" 

/translation="MSDSFSLPLRPLTEKPDRPD 

TLPIEIAQINARWGSFRDVNEETL 

LAKIEEDKNRDPWEEEDEDEKLAEDVDSTERLET 

LYKRRAEILQFAMQAHMEALFALD 

FVSLLLSKHTPRQAETSMSAFLKQVAPLGSLNAE 

IVEPPPKSEAAVQDVKTVSRGWRA 

QNFNAAANKLLNSATRLEEEVASETKYWDEVLAV 

KEKGWKVCRLPRERTALGVQYGFL 

EATPIFRDRGLAALRRADNGSLILDKGLAPQKTR 

T VRVRVKHRGQ L TGC S KMP DP I P D 

AASVERRILQARDTVYEEELFHELMREARIMGSH 

GVTTRQNLVRLPISEDQEILLDLV 

DADRETPDEDKIESTEHDVLANGLSHSIRILLAY 



gene 
mRNA 



complement (138020. .1392 
87) 

complement (139288. .1393 

37) 

complement (139338. .1397 
23) 

complement (139724. .1397 
89) 

complement (139790. .1400 
61) 

complement (<141613 . .>14 
3279) 

complement (join(<141613 

. . 142684, 

142735. .>143279) ) 

complement (join (141613. 

.142684, 

142735. .143279) ) 



AHRQNLRRRTQPPPPLSQKRRLTP 

EYQLLRPVMAYLQHSSHVRWLESFMNDIYRVLRA 

AGIERDFHAVPFASVSLPKVQAVP 

KVESLVQQFLMPFESTFSGHLITPQSSFRVKIRT 

NPVSPPFGTHYDI S I DLPHHPEVQ 

PPGRVGLQNEAAAALMHFVKLDIVSAIALQGSKT 

TKNTTKKEGAEGTLTWEAAYPHHG 

ELLAFSTAGQSKKLKISLSREELKVETFSLRGLE 

GFGRSGGPKAPSLQSQTWTATQTD 

SKPGLMDFVAEVSREQS " 

/locus-tag="Anl8g01610" 

/number=l 

/locus-tag="Anl8g01610" 
/number=l 

/locus-tag="Anl8g01610" 
/number=2 

/locus-tag="Anl8g01610" 



/number=2 

/locus-tag="Anl8g01610" 
/number=3 

/locus-tag="Anl8g0162 0" 
/locus-tag="Anl8g01620" 

/locus-tag="Anl8g01620" 



/EC-number= " 5 . 2 . 1 . b " 
/inference="profile:COGS:COG4716" 
/note="unnamed protein product; 
Catalytic activity: linoleate 
isomerases catalyse the 
isomerisation of 

9-cis, 12-cis-octadecadienoate <=> 
9-cis, 11-trans-octadecadienoate . 
Function: linoleate isomerase of 
patent WO9932604-A1 is used to 
produce conjugated 

linoleic/linolenic acid (CLA) from 
oils such as sunflower oil, 
safflower oil, corn oil, linseed 
oil, etc. Similarity: the ORF 
shows strong similarity to 
myosin-crossreactive antigen, 
which could induce autoimmune 
reactions in vertebrates. Title: 
strong similarity to linoleate 
isomerase from patent WO9932604-A1 
- Lactobacillus reuteri" 
/codon-start=l 
/protein-id="CAK47235. 1" 
/db-xref="GI : 134084202" 
/translation="MNRGNGLPKRDPKTTQAWLI 
G S G I S S L AGAVHL I HDAGVP AAN I 
HILDVHSHAGGGIKSCGDAKSGYVLYTGSLPYFH 
DRCVEHLLSLVPSVDDPKQSLLDS 



IKELDRATSPKKSAPTRLIKHGDKGPECLYSKHL 
Q I GPKYRMEL I KLMLENERI LGEK 
AIQDAFDEAFFKTNFWTLWATTFALQPWHSLVEF 
RRCLCKHLAEIERLNDVKALDRTK 
YTIYESVIMPIESYLKSQGVDFHFNAKVTNLQIN 
PKEAQTTVSGIIIKDNGEQKTIEV 
RPEDLVMVTLGSTTSATERGSNDKAPAAPPQHSK 
EFLDDDWALWIDLMQASTDYGNPF 
NFHNNVDQSTLESFTVTLRDSDFMERYEKLTNNK 
PGTGALLSFSDSNWGLSISVPRQP 
VC S DQP S S VDVF WG YGLHPEKTGNFVHKPMCHC S 
GKEILTEVLSQLGMPVDDMLANSI 
TNPVLMPMATAPLMPRRHDYRPEVIPPQSRNLAL 
VGQYVEIQDDTTLSMEYSVRGAQM 
AVFSAMKLNKHPPKIERHLLLSVFDLLGGA" 
complement (141613 . .1426 /locus-tag="Anl8g01620" 



complement ( 142685 , 
34) 

complement ( 142735 , 
79) 

<144032. .>145578 
join (<144032 . .1441 
144230 . .145373, 
145427. .>145578) 
join(144032. .14416 
144230. .145373, 
145427. .145578) 



/number=l 
.1427 /locus-tag="Anl8g01620" 

/number=l 
.1432 /locus-tag="Anl8g01620" 

/number=2 

/locus-tag="Anl8g01630" 
69, /locus-tag="Anl8g01630" 



/locus-tag="Anl8g01630" 



/EC-number="l . 5.3.-" 
/inference="prof ile:COGS:COG0277" 
/inference="prof ile:PFAM:PF01565" 
/note="unnamed protein product; 
Catalytic activity: HDNO of A. 
oxidans catalyses 
(D) -6-hydroxynicotine + H(2)0 + 
0(2) = 

1- ( 6-hydroxypyrid-3-yl ) -4- (methyla 
mino)butan-l-one + H(2)0(2). 
Induction: HDNO of A. oxidans 

requires FAD in the enzymatic 
assay for maximal enzyme activity. 
Similarity: the ORF shows 
similarity to several 
FAD-dependent oxygenases from 
different species and with various 
specificity. Title: strong 
similarity to 6-hydroxy-D-nicotine 
oxidase 6-HDNO - Arthrobacter 
oxidans" 
/citation= [4] 
/ codon-start=l 
/protein-id="CAK47236 .1" 
/db-xref="GI : 134084203" 
/translation="MPFLSFARALELRRQLEGTR 
AEWCIGSDDYATSIRRWSDTCEK 
EAGAWRVT S T SEVAEWRFCRKNH I DF WEAGG 
HSTTGASSSHGGWISMARMCKVL 
T DP AS E T VC VQGGANWDMVNH S T AP YGL AWG AT 
ASHSGVGGSALGGGFGWLTGQHGL 



lADQLLSVKMVLADGSIVEASDEDNQDLFWAVRG 
AGQAFGVATEFVFRAHKVRDRFFG 
GLVYYDVDKLPMLVSFANEFDKRQDPKSGFFFGF 
AAPREIGHMWLAVLFYDGSAYDG 
EAFFEP I LNPNPL INRAAMKS Y I EMNS I ANVDPV 
PEGRKSIGGANIMPPLETSLLQNL 
YSQFKEAMNTYPRMEDSALVFELLPYTKAVQVPI 
KETACANRGPYYNVGLILCWHDSD 
LDAKMHALQRS I ISKILEAQRDITDDHAWYPNL 
AGHDVSAEKLFGANLPRLQKLKKK 
YDPHNVFRKWHDLLAPARSHVEQTDKP " 
exon 144032. .144169 /locus-tag="Anl8g01630" 

/number=l 

intron 144170 .. 144229 /locus-tag="Anl8g01630" 

/number=l 

exon 144230 .. 145373 /locus-tag="Anl8g01630" 

/number=2 

intron 145374 .. 145426 /locus-tag="Anl8g01630" 

/number=2 

exon 145427. .145578 /locus-tag="Anl8g01630" 

/number=3 

gene <146433 . .>148685 /locus-tag="Anl8g01640" 

mRNA join (<146433. .146489, /locus-tag="Anl8g01640" 

146546. .146952, 

146998. .147273, 

147331 . .147795, 

147845. .147954, 

148006. .>148685) 

CDS join(146433. .146489, /locus-tag="Anl8g01640" 

146546. .146952, 
146998. .147273, 
147331. .147795, 
147845. .147954, 
148006. .148685) 

/EC-number = " 2.4.1.-" 
/inference="prof ile:COGS:COG0438" 
/inf erence="similar to AA 
sequence :PIR: TO 0130" 
/note="unnamed protein product; 
Function: TSase of G. frondosa 
catalyzes the trehalose synthesis 
from alpha-D-glucose 1-phosphate 
and D-glucose. Similarity: the ORF 
shows similarity to a 
clock-controlled gene (cogs) of N. 
crassa with unknown function. 
Title: strong similarity to 
trehalose synthase TSase - Grifola 
frondosa" 
/citation=[47] 
/citation=[49] 
/ codon-start=l 
/protein-id="CAK47237. 1" 
/db-xref="GI : 134084204" 
/translation="MSSARSPFDAESWSGKPLNV 
lYAGITELVCNNSSGRVAIAIRNL 
TDLVDFLVCTWHALRPNVS DYATDT I I AELKDYR 
EKNTEKIISAALHQSLVYRCPSLC 
SRLWSELDIVPLVLEHKDRERRHDDQGELATFAG 
WNKKELDERADSMVRKCIRSFGIG 
HVLHNHINFDGSVDVDRGYHVHLANAKDYEKTVD 
PATWSMAQCFAQDLREREVKVAFF 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

gene 

tRNA 



146433. .146489 
146490. .146545 
146546. .146952 
146953. .146997 
146998. .147273 
147274. .147330 
147331. .147795 
147796. .147844 
147845. .147954 
147955. .148005 



complement (149158. .1492 
30) 



complement (149158. .1492 
30) 



<149921. .>150603 
join(<149921. .149936, 
150144. .150502, 
150577. .>150603) 
join(149921. .149936, 
150144. .150502, 
150577. .150603) 



SMTCQGKPDVP TRHAL SRF TKS LGVR I KWF VP KP 

RPGMIPLIRKMQDTLEGQGDPLSD 

ITINDELLILDFAYANARRYWLCENGPLRPRAEG 

GVDWIIDSAPLLTLAILSKQQDP 

ERPVIFESSLQPQGVSLSGTSSPQSRAWDFIRTR 

LTHVDLWSLLPKELAPRIMPEEN 

VGYMSFSVDQLDGQNKPLTDWDVGFYGREFSSLC 

RTLQMSIIRYPEEQYILHLSQFRP 

GDGTLCLLHSYQKFCDAYTKEHPGRQVPKLLICH 

RGPFRTPESTVFYDAAMSQIDSSE 

TLSTSVCIIPI GAVDQMWNAL L TNARAL VQL S T L 

HGVPEMLLAAIQKGTPVIAVREAE 

LFPFVHESENAILVDKGDEEGIARCFSRIFSVDE 

VRQGEGDAGFRKLSDSNTTVGNAV 

CWLYLASKMSRGDKFEPRGADIYKLAREEAGYNE 

CDWHAV" 

/locus-tag="Anl8g01640" 
/number=l 

/locus-tag="Anl8g01640" 
/number=l 

/locus-tag="Anl8g01640" 
/number=2 

/locus-tag="Anl8g01640" 
/number=2 

/locus-tag="Anl8g01640" 
/number=3 
/locus-tag= 
/ number= 
/locus-tag= 
/number=4 
/locus-tag="Anl8g01640" 
/number=4 

/locus-tag="Anl8g016 40" 
/number=5 

/locus-tag="Anl8g01640" 
/number=5 

/locus-tag="Anl8g01640" 
/number=6 

/gene="tRNA-Lys (CTT) " 

/locus-tag="Anl8e01650" 

/gene="tRNA-Lys (CTT)" 

/locus-tag="Anl8e01650" 

/product="tRNA-Lys" 

/inf erence="prof ile : tRNAscan : 1 . 4" 

/note="codon recognized: AAG" 

/locus-tag="Anl8g01660" 

/locus-tag="Anl8g01660" 



:"Anl8g01640" 
:"Anl8g01640" 



/locus-tag="Anl8g0166 0" 



/note="unnamed protein product; 
Remark: the ORE is questionable 
due to its suboptimal intron-exon 
structure. Similarity: the ORE 
overlaps with the 5 ' region of the 
A. niger pfkA gene. Title: 
questionable ORF" 
/ codon-start=l 



exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



149921. .149936 
149937. .150143 
150144. .150502 
150503. .150576 

150577. .150603 

<150631. .>153085 

join(<150631. .151371, 
151428. .151889, 
151937. .>153085) 

join(150631. .151371, 
151428. .151889, 
151937. .153085) 



/protein-id="CAK47238 . 1" 

/db-xref="GI : 134084205" 

/translation="MKETESGDIRCFQDCKRKWA 

EEDIPCQHPSRICPLQRAARSPGR 

SAFDEHLALSPADQLSPQWWAVYSVQSTFPSNER 

GYLYSGAVTAGVSDVLVAHTRFFS 

PAPALLFSPTLQTQVISAFPPQALLLIPNNC" 

/locus-tag="Anl8g01660" 

/ number=l 

/locus-tag="Anl8g01660" 
/number=l 

/locus-tag="Anl8g01660" 
/number=2 

/locus-tag="Anl8g01660" 

/number=2 

/locus-tag="Anl8g01660" 

/number=3 

/gene="pfkA" 

/locus-tag="Anl8g01670" 

/gene="pfkA" 



/locus-tag="Anl8g01670" 
/gene="pf kA" 



/locus-tag="Anl8g016 70" 
/EC-number= "2.7.1.11" 
/inference="prof ile:COGS:COG0205" 
/inference="prof lie: PFAM:PFO 0365" 
/inf erence="similar to AA 
sequence : SWISSPROT : K6PF . ASPNG" 
/note="Catalytic activity: pfkA of 
A. niger is catalysing ATP + 
D-fructose 6-phosphate <=> ADP + 
D-fructose 1 , 6-bisphosphate . 
Function: pfkA of A. niger is 
active in a key control step of 
glycolysis. Gene-ID: pfkA 
Similarity: the ORF overlaps with 
the sequence of entry EMBL:ANPFKA 
(A. niger pfkA gene) cytoplasm" 
/citation=[l] 
/citation= [ 5 ] 
/citation= [21] 
/citation= [35] 
/codon-start=l 

/product="6-phosphof ructokinase 

pf kA-Aspergillus niger" 

/protein-id="CAK47239 .1" 

/db-xref="GI : 134084206" 

/db-xref="InterPro:IPR000023" 

/db-xref="InterPro: IPR009161" 

/translation="MAPPQAPVQPPKRRRIGVLT 

SGGDAPGMNGWRAWRMAI HS DC 

EAFAVYEGYEGLVNGGDMIRQLHWEDVRGWLSRG 

GTLIGSARCMTFRERPGRLRAAKN 

MVLRGIDALWCGGDGSLTGADVFRSEWPGLLKE 

LVETGELTEEQVKPYQILNIVGLV 

GSIDNDMSGTDATIGCYSSLTRICDAVDDVFDTA 

F SHQRGFVI EVMGRHCGWLALMS A 

ISTGADWLFVPEMPPKDGWEDDMCAIITKNRKER 



150631. .151371 



151372. .151427 



151428. .15188 



151890. .151936 



151937. .153085 



gene 
mRNA 



GKRRT I VI VAEGAQDRHLNKI S S S 
KIKDILTERLNLDTRVTVLGHTQRGGAACAYDRW 
L S T L QG VE AVR AVL DMKP E AP S P V 
I T I RENKI LRMPLMDAVQHTKTVTKH I QNKEF AE 
AMALRDSEFKEYHFSYINTSTPDH 
PKLLLPENKRMRIGIIHVGAPAGGMNQATRAAVA 
YCLTRGHTPLAIHNGFPGLCRHYD 
DTPICSVREVAWQESDAWVNEGGSDIGTNRGLPG 
DDLATTAKSFKKFGFDALFWGGF 
EAFTAVSQLRQAREKYPEFKIPMTVLPATISNNV 
PGTEYSLGSDTCLNTLIDFCDAIR 
QSASSSRRRVFVIETQGGKSGYIATTAGLSVGAV 
AVYIPEEGIDIKMLARDIDFLRDN 
FARDKGANRAGKIILRNECASSTYTTQWADMIK 
EEAKGRFESRAAVPGHFQQGGKPS 
PMDRIRALRMATKCMLHLESYAGKSADEIAADEL 
SASVIGIKGSQVLFSPMGGETGLE 
ATETDWARRRPKTEFWLELQDTVNILSGRASVNN 
ATWSCYENA" 
/ gene="pf kA" 
/locus-tag="Anl8g01670" 
/number=l 
/gene="pfkA" 
/locus-tag="Anl8g01670" 
/number=l 
/gene="pf kA" 
/locus-tag="Anl8g01670" 
/number=2 
/gene="pf kA" 
/locus-tag="Anl8g01670" 
/number=2 
/gene="pf kA" 
/locus-tag="Anl8g015 70" 
/number=3 
.>15 /locus-tag="Anl8g01680" 



complement (<1534 
4b08) 

complement (join (<153410 /locus-tag="Anl8g01680" 

. .153429, 

153509. .>154508) ) 

complement ( join ( 153410 . /locus-tag="Anl8g01680" 
.153429, 

153509. .154508) ) 

/EC-number = " 1.6.-.-" 
/inference= "profile: COGS :COG0 49 2" 
/inference="prof ile:PFAM:PF0 0 0 70" 
/inf erence="similar to AA 
sequence : UniPr otKB :AF281147.1" 
/note="unnamed protein product; 
Catalytic activity: NADH 
dehydrogenases catalyse NADH + 
acceptor <=> NAD(+) + reduced 
acceptor. Function: ndh of P. 
fluorescens is involved in 
colonization of this bacterium. 
Phenotype: in an oxygen-poor 
medium mutant PCL1201 of P. 
fluorescens, which carries a 
mutation in the ndh gene showed a 
decreased growth rate. Remark: the 
sequence of the database entry 
EMBL:ANPFKA (A. niger pfkA gene) 
overlaps with the ORF due a 



gene 
mRNA 



probably wrong nummer of 465 8 bp 
for the coding sequence of the A. 
niger protein, which is encoded 
3'. Similarity: the ORF is 112 
amino acids shorter at its 
C-terminal end than ndh of P. 
fluorescens. Title: similarity to 
NADH dehydrogenase II ndh 
-Pseudomonas fluorescens 
localisation :mitochondrion" 
/citation=[46] 
/codon-start=l 
/protein-id="CAK47240 .1" 
/db-xref="GI : 134084207" 
/translation="MTTDKTITIPIIGASFAGLT 
TAHSLLSHFTTNKSKGIKIKILLI 
NPTPNFYWAIAAPRILAKPTAFSESQYLIPIADG 
FTKYSPEVFEFILGRATSLDFENK 
LLQVEESTTNNTNKALREIKYDYLVIASGSTPSA 
SSSSPLFPGETETENKDADRKEVI 
YPFKLSPTSTSTITETIQSAQHTISTSKKITIIG 
AGP I GVE L AGE L ADL T S S AS S KEK 
KDITLISSTPRILPVLKESASGTATSLLTSKGVR 
VLTNTKVI SVS ASKEGGGGYELKF 
ENGETMDTDIYIPTIGVLPNSSYIPGEVLDEKGW 
VRVDSELKWGVDGVYAAGDVTDL LGRGG" 
complement (154443 . . 1545 /locus-tag="Anl8g01680" 



complement (join (153413 
. 153429, 

153509 . . 154442) ) 

complement (153410. .1534 
29) 

complement (153430. .1535 
08) 

complement (153509 
08) 



/inference= "protein 
motif :SignalP: 2.0" 
/locus-tag="Anl8g01680" 



/product=" unnamed" 
/locus-tag="Anl8g01580" 



/ number=l 
/locus-tag="Anl8 

/number =1 
) /locus-tag="Anl8 

/number=2 
I /locus-tag="Anl8 



complement (<154789 . .>15 /locus-tag="Anl8g01690" 

6395) 

complement (join (<154789 /locus-tag="Anl8g01690" 

. . 154869, 

154921. .155060, 

155120. .155914, 

155981. .156128, 

156179. .156193, 

156245. .156318, 

156371. .>156395) ) 

complement ( join (154789 . /locus-tag="Anl8g01690" 

. 154869, 154921 . . 155060, 

155120. .155914, 

155981 . . 156128, 

156179. .156193, 

156245. .156318, 

156371. .156395) ) 

/inf erence="similar to AA 
sequence :PIR:S6 70 43" 



complement (154789 
69) 



complement ( 154870 . 
20) 



complement ( 154921 . 
60) 



complement ( 155061 . 
19) 



complement ( 155120 . 
14) 



complement ( 155915 . 
80) 



complement (155981 . 
28) 



complement (156129 
78) 



complement (156179 . 
93) 



complement (156194 
44) 



complement (156245 . 
18) 



complement (156319 . 
70) 



complement (156371 . 
95) 



/note="unnamed protein product; 

Title: strong similarity to 

hypothetical protein YOR155c - 

Saccharomyces cerevisiae" 

/ codon-start=l 

/protein-id="CAK47241 .1" 

/db-xref="GI: 134084208" 

/translation="MTTRYRVEYALKVGSHLCCQ 

SMLTPAVSPSRPIIEWIKGLLAVP 

FVLHSQPTAVYQEHSENLIAVAADTHQRYAEIFR 

DVEMLIRDHITHQQDDAPGKSKLK 

LLVPTVGTFFTPLYLVDAFRRQDAQRFISRRRFV 

APSFNDIRLILNSAQLIGLARTTG 

VDLVTFDGDVTLYDDGACLTDDNPVILRIMRLLL 

QGRKVGIVTAAGYTDAAKYYERLK 

GLLDAMYESAEMTDAQRAGLWMGGESNFLFRYD 

HASPSRLSYVPREEWLLEEMKTWQ 

EGDITRLLDIAESSLRACASNLNLPVAVLRKDRA 

VGVYPTERGRI SREQLEETVLWQ 

NTVERSEVGARLPFCAFNGGNDVFVDIGDKSWGV 

RACQRYFGGIEPARTLHVGDQFLS 

AGANDFKARLASTTAWIASPAETVQLLDELQGI" 

/locus-tag="Anl8g01690" 

/number=l 

/locus-tag="Anl8g01690" 
/number=l 

/locus-tag="Anl8g01690" 
/number=2 

/locus-tag="Anl8g0169 0" 



/number=2 

/locus-tag="Anl8g0169 0" 



/ number=3 

/locus-tag="Anl8g01690" 



/number=3 

/locus-tag="Anl8g01690" 



/number=4 

/locus-tag="Anl8g01690" 



/number=4 

/locus-tag="Anl8g01690" 



/number=5 

/locus-tag="Anl8g01690" 



/number=5 

/locus-tag="Anl8g01690" 



/number=6 

/locus-tag="Anl8g01690" 



/number=6 

/locus-tag="Anl8g01690" 



complement (<156827. .>15 



/number=7 

/locus-tag="Anl8g01700" 



8638) 

mRNA complement ( join (<156827 /locus-tag="Anl8g01700" 

. .157888, 

157938. .158066, 

158115. .158310, 

158364. .>158638) ) 
CDS complement( join(156827. /locus-tag="Anl8g01700" 

.157888, 15793 8.. 158066, 

158115. .158310, 

158364. .158638) ) 

/inference="prof ilerCOGS :COG0 4 77" 

/inference="prof ile:PFAM:PF00083" 

/note="unnamed protein product; 

Function: qutD of E. nidulans is 

involved in quinic acid transport 

with similarity to the maltose 

transport protein MAL61. 

Phenotype: mutations in the qutD 

gene of A. nidulans cause the loss 

of ability to grow upon quinic 

acid as sole carbon source in 

media at normal pH 6 . 5 and 

failure to induce three enzyme 

activities specifically required 

for metabolism to protochatechuic 

acid. Similarity: the ORF overlaps 

with A. niger EST 

EMBLEST:BE759752. Title: strong 

similarity to quinate transport 

protein qutD - Aspergillus 

nidulans" 

/citation= [6] 

/citation=[10] 

/codon-start=l 

/protein-id="CAK47242 .1" 

/db-xref="GI : 134084209" 

/translation="MAGSKKPVNIFRLRNLADPK 

EVFNWRLWFAVLSFGLMGAARGVD 

EGLISGAFNSKDFQRTIHLDSYSEVEQTNIKANV 

S AMVQ I G S VGG ALF AF L VC DR I GR 

LWATRQLCVLWILGIAIFMGVGANGSLGAVYAGR 

FVAGLGVGQTVWGPVYLAE TAP A 

SVRGLCTCVFTGFVYLGIVLAYFTNYGCQIHMGD 

HTHKRWEVPTSLHIMFAGLIFLLS 

FLQYESPRFLVKKGKPEEALKNLARLRNLPPEHE 

YWEEFTG I QN AHQAEME ATMGAG 

WLGIVKEAIMVPSNLYRLYLAAMAQLLSQWSGAG 

SITLYAPDLFSLLGITGSNESLLV 

TAVFG II KLVAAVI CALFLVDVI GRKRALLLG I T 

LQAISMIYVASFLTAVPEMGIVDD 

FVLPESKKGSSRGAIAMIYISGVGWALGWNSMQY 

LLTAELFPLRIRALATSAAMTLHF 

VNQYGNSRAVPNMLLGTAEGGITPKGTFWFFSAV 

TVLGGLWVWFSVPETAGRSLESMD 

RLFALPWYQ I GRHGNRDADQQDQWADKQQQVED 

GFGGAEHRERV" 

sig-peptide complement ( 158513 .. 1586 /locus-tag="Anl8g01700" 

38) 

/inference= "protein 
motif :SignalP: 2.0" 
mat-peptide complement ( join ( 156830 . /locus-tag="Anl8g01700" 

.157888, 157938. .158066, 



158115. .158 
158364. .158 



310, 
512) ) 



exon 

intron 

exon 



complement (156827. .1578 



complement ( 1 
37) 



complement (157938. .1580 
66) 



complement (158067. .1581 



complement (158115. .1583 
10) 



complement (158311. .1583 
63) 



complement (158364. .1586 
38) 



<158961 
join (<15896 
159213. 
join(158961 
159213. .159 



59525 
1 . . 15918 
9525) 

159184 
525) 



158961. .159184 

159185. .159212 

159213. .159525 

<159975. .>161849 
join(<159975. .160316, 
160367. .160732, 
160782. .161738, 
161796. .>161849) 
join(159975. .160316, 
160367. .160732, 
160782 . . 161738, 
161796. .161849) 



/product="unnamed" 
/locus-tag="Anl8g01700" 

/number=l 

/locus-tag="Anl8g01700" 
/number=l 

/locus-tag="Anl8g01700" 
/number=2 

/locus-tag="Anl8g01700" 
/number=2 

/locus-tag="Anl8g01700" 
/number=3 

/locus-tag="Anl8g01700" 
/number=3 

/locus-tag="Anl8g01700" 
/number=4 

/locus-tag="Anl8g01710" 

/locus-tag="Anl8g01710" 

/locus-tag="Anl8g01710" 
/ cocion-start=l 

/prociuct=" hypothetical protein" 

/protein-icl="CAK47243 .1" 

/cib-xref="GI: 134084210" 

/translation="MLDSLASSKATKFRRCAGFV 

VGRSVLAAGDNPGIEPLRGIPDPV 

LKVQVGQTDNLGTRLGQLPLRTNGSRPSRCERET 

GDGLSTGRHGIWLPGEVEMGATRR 

EMLGSCGGQPKGGVSLAQGLNNTKRALRKQTWWL 

ACPAGRSNPTSGGYMTFAGHDRSA 

VSQSGVQQQMPRSPAVLG" 

/locus-tag="Anl8g01710" 

/number=l 

/locus-tag="Anl8g01710" 
/number=l 

/locus-tag="Anl8g01710" 
/number=2 

/locus-tag="Anl8g0172 0" 
/locus-tag="Anl8g0172 0" 



/locus-tag="Anl8g01720" 



/inference="profile: COGS :COG0 477" 
/note="unnamed protein product; 
Function: CaMDRl of C. albicans 
the confers insensitivity to the 
anti-mitotic drug, benomyl, and to 
the dihydrof olate reductase 
inhibitor, methotrexate. Remark: 



exon 

intron 

exon 

intron 

exon 

intron 

gene 
mRNA 
CDS 



159975. .160316 

160317. .160366 

160367. .160732 

160733. .160781 

160782. .161738 

161739 . . 161795 

161796. .161849 

complement (<162686 . 
3666) 

complement (<162686 . 
3666) 

complement ( 162686 . . 
666) 



seven different mutant alleles of 

CaMDRl from C. albicans showed 

distinct drug resistance profiles. 

Similarity: the ORF shows 

similarity to several major 

facilitator proteins from 

different species. Title: strong 

similarity to benomyl/methotrexate 

resistance protein MDRl - Candida 

albicans plasma membrane" 

/citation=[13] 

/citation=[20] 

/codon-start=l 

/protein-id="CAK47244.1" 

/db-xref="GI : 134084211" 

/translation="MSCLIQDSVFGQFIRLVSRG 

RYFRFEEQKDPSLWQQYINLDKSG 

RMAYTGRPDADEVPEPERFSAADTESMTRVPSTA 

WSHNGRRYQGITGAAIDPEKGRDV 

TIIDWWDDHDPENPQHWPLWKKVWTFEICLLTF 

SVYIGSAIYSAGIESVMSEFQISQ 

VAATLGLTLFVAGYGLGPLLWSPMSEVPQIGRNS 

VYIATLIVFVALQVPAALAGNLGT 

LLAFRFLTGFFGSPALATGGASLADMFAPRKRAY 

GIGIWGISAIC GP VLGP L VGGF AA 

QAKGWRWTIWELMWLSGFTLWLIIFLPETSSSN 

ILYRRARRLRKLTNRSNLRSEPEL 

ASEHLTARELAMMTLVRPFTLNFTEPMVFLLNLY 

lALIYGLLYVWFESFTIVFSGIYG 

FNLGQQGLAY I G I LTGAL I T I PP YYWWMHKYLEP 

KFDPETGNVPPEARLPPAIVGGFF 

IPICLFWFGWSARPSIHWIMPIVGSGFFSVGAFL 

LFNPVLNYL S DAYPE YAAS VLAGN 

DLFRSAFGAGFPLFATAMYKNLGVDWASSTLAFL 

GCAFIPIPWLMKYGGTLRKKYSR HARKDI" 

/locus-tag="Anl8g0172 0" 

/number=l 

/locus-tag="Anl8g01720" 
/number=l 

/locus-tag="Anl8g01720" 
/number=2 

/locus-tag="Anl8g01720" 

/ number=2 

/locus-tag="Anl8g01720" 
/ number=3 

/locus-tag="Anl8g0172 0" 
/number=3 

/locus-tag="Anl8g0172 0" 
/number=4 
.>16 /locus-tag="Anl8g01730" 

.>16 /locus-tag="Anl8g01730" 

>163 /locus-tag="Anl8g01730" 

/note="unnamed protein product; 
Remark: the ORF is N-terminally 
truncated due to contig border. 
Title: strong similarity to 
hypothetical protein encoded by 
Anl2g00980 - Aspergillus niger 
[truncated ORF] " 



/codon-start=l 
/protein-id="CAK47245.1" 
/db-xref="GI : 134084212" 
/translation="GLSAWYMPDDYSFSTFSMTP 
VSTYDAWYTNATLIAIQYMAPRA 
NLELLDDSPMTNGTQWEATECSLEPIVRSFRPTV 
YQNNYSDKTRAIWFQYNWTSTNNV 
HQSAGLNLTPDSWGSDLGVHKAPQSFILSGDAKQ 
WIYPFLTAIFSGQATRSGSEIYFD 
PLDKGWYAGFDTLQALAVANITGCSEILSDRLSC 
AMDNVAAAMSKTFRDVAYWNPAS 
AAMTTGRAKVSEVYIWYWAYLALPVLVWTLGAI 
LVFGTLWKTRRARVPAWRNDTLPL 
LFLYRDQSEGMESLQGSETVQASGHEMVRLYTDE 
SRVFLGRCTDTDSIRP" 
exon complement(162686. .>163 /locus-tag="Anl8g01730" 

666) 

/number=l 

SEQUENCE (SEQ) : 

1 gcttatatgt atctgtcgct agcgcatcct ggtagcctaa atgaggggta caaggctgtc 
61 actaaggaca ccatatactg ctaagtcgca ctacacaact acaggagctc agaaaccatt 
121 acatggtagc ttgtgtgagg caacgtcgtt tgttggctgc agtagcgtgt ggctacgggg 
181 tatttcttgc tccacccaat acaaaggaaa gctgaacatc cacttaaatc tcacgtcagg 
241 cggacgagtc gtgactccga ttccagccgg gtttgagcac ggacaaggtc gcgatttgac 
301 ggatgaattg gccaatgaaa ccttggtaga ttagctactt agtattgcgc taattgctaa 
361 gctttctgtt cacttagcgt cgccgtccgg tcgacagcgc cgtggctcgc atctagcttg 
421 cattcagcgc atcgccagct gccagcggat gcaggattct attttttgaa ctggacttgt 
481 cggttttagg tagctctacc tcggtggcca atgcagctta cttgcagact cgatgcttgg 
541 tttaactgtg tcacttaata tctggatcgg cacgttgacg ctatattgca ttcatactac 
601 gttgagcaaa ccggaccaag aatatatatc ttgccatggg ccagatgggc atagtagcac 
661 gaactttatt acccgcgaat agtcgttcaa ttatatattc agtacattat aaacgctcca 
721 tacaagctac ataagcgagc taaaatggaa aatctaaggt tgacacacag attcgccgcc 
781 tatcaatccg gcctgtccgt aaagcagttc ttgcgaaacg aagctttaac ccttagctat 
841 aaggcaaaag gagctgcaac aaacatcaag tacggccggg tggcgcaatg gtagcgcgct 
901 gccttccggt agtggatgtg gcagtggttg cgtgttcgag tcacgtctcg gtcgtttctt 
961 ttttcttttg catctacact ccataccaga agtgatagtc ttagatttca ttttttattg 
1021 gtcgcaacta ttattttatc cccgcctccc ctctacttag tcccacccgc tttcattacg 
1081 gtttgcgctc gcacacaccc gatatccacc atggtatccc actcttcagt acatcgtaga 
1141 aagcaagggc cttacgagaa cagcacttga ctaaagtccg tctcgagtat cgtgtctctc 
1201 tggccattca caattttgca ttaggcgcga tctgagttga tcaagctacc gaggaaagaa 
1261 tcatgtctgc ggcctacaat tgacatatca gcagcagcac catgccatca tgctcagtac 
1321 ctacctgttc agcaacacca tatgtaggcg cctgataatg cgccgccaac gggagcggaa 
1381 taatgcccga gtctacaacc ctcagtcgct cagcctgata gactcgacac tcagtgtcca 
1441 caacttctcc catcgcacaa gtccctgagt agtgatacgt gctaccacca ctccgattcg 
1501 cccggacctc gatcgcctca ttcgaccagt tatcccggag ggaatcatcc acaccaaact 
1561 caacagcatt gagaccctcg acggcctgga tggctcgcgc agtctcccgc acattctcat 
1621 acaacgctga tcggtcaacc ggggacgcaa ggagtttggg atcgagcacg ggtgcgtcag 
1681 cagggtcaga tgagcgcagc gtgagagtac cccgtgacac tgagtcgacg agaacgtgca 
1741 tgaggctgat gacgcgctta ccgggccagc cttcgagggg tttcttgcga gcggacgtgt 
18 01 cgacggcggt gtagctgact tgattagcag tgtcccttct tctgcgggcc tggaggaaca 
1861 gcatacacga taaaattttc gagttgcatc ttcccgtcag ccaagtaccg acttcgctca 
1921 gttgggctca gtattttctt ggctatctct tgcgttttcg tgtctgcagg cgcaaaggac 
1981 atccaatcgt atggcaaacc cgctagccaa tggacttgat cggtttccat ctccccgtca 
2041 cctagggtgg cattacactc gtcgattgtc cagtaggtga atatatgcgg atggtcagca 
2101 tagttcttcc ccacggggag gtctgcgacc ggggtaatgt tatgtcgctt cagttctgct 
2161 gctgggccga tgccgctggt actgttatta ggctcctcac ttctcagaag cagctaatgc 
2221 taacccacct tagtaaaagt attttgggtg atccatatgc tcctgccgac agaatcacct 
2281 cctttcgagc cctgacctgg acacgctcaa tctgatctcc cactgctcgc gcaacggcga 
2341 cgccttccgc agtccttccg gccatgatga gtctctccac cgttgctttc gtccagagag 
2401 tgacattgct ccccaaaggg tagctactcg caaagctcct cagtcctgca taggaggaac 
2461 tgaaagcctc cgaatacccc agctgctgtc cactattcat atcgtctgcc tgcttgcgac 



2521 cgaggaactc gtgaaatctg cgactcttct cccgcaacgg gtaatttcgt ggcttgccac 

2581 tattcgtcgc atggctgacc tatacaatat caatacggtt cctcatgcca gtaaagtgcg 

2641 atagctacct tgatcggtcc acgatctcca tgagaatcaa tcgtgagaag acctctcttc 

2701 ttcatctcta cagaaggata ccacgtctcg acccttttga aatacggcag aagaccattc 

2761 cacgaccatc tccggtctta aactaaatct gcccattggt cactgtccca acaggtccat 

2821 atcagcaaca atctccggaa ccataatatc attcccgctc ctcgagagga actcactaat 

2881 caacagccgg cccccgcgta tacacctgat aattaacccc cgaccctccc ccgacaatct 

2941 tgccagaagc caaacaaacc gtatcacctc cgaaagcgcg ctcctgcggc accgagcgaa 

3001 gattccattc gagatctgag tcgaaccctg ccacgaagcc cattgcggcg gatgtccggt 

3061 catctttctc gatgcctgct tcaatgagca ggatgcgtaa gtcgggacga ctttgcgaga 

3121 ttcgacttgc taagacgcag cctgttgctt gtcaatgagg acgggcgtgg ttgttatggg 

3181 ggtattaccc gcaacgcctg ctccgactat gatgaagtcg aatgtttcct ctatgtgcat 

3241 gttgctgatg tgacatttgg tggataacta taactgcaag cgatggatga cagagactgt 

3301 atgccactga tgcaacgagc gagtaatata ctcgaagcta cgggtggcct gaaaagagta 

3361 agatactcag agctcgtccg tttgcggaga tgctggctag gtagattttg atggctgtta 

3421 cacgacgaga gtcagctgtt cgggaaggaa gctttgcatg ccatttgcta tccgcagaaa 

3481 accccacgca tcatctggtt gaataatgga catgaaacag acaagcatgt cgttacattc 

3541 aacctaaagg cactttggct ttaaggcgtg cactatgacc gcatgatggc tggtctggac 

3601 accagagaga gttggacgcg aaacctaatt tgcatgaggg gaaacggcct cgccgtcagg 

3661 acttcccgag tatcaacaat caagaaaaac acaatgagct aaagtctcta attccatcat 

3721 catatcaatc atcatcatct catctattct acatctaagt caaaacactc ctggcagaac 

3781 gacgaaccta tttccaggta tcctcggtaa ccgcccaggg acccgggggc accttatcat 

3841 tccactcgcc gcgcggaggg cgccacagca aaagaatagc gggcggttga cattcatgca 

3901 ccaacctgtt ccactcatca ctggggttct tcttcgcaaa gtcaatattc agactgatga 

3961 cgcgcttgta tccgctctct tcatacaacc agcgaccgat aggactgccc tcgatccagc 

4021 tctcatatcc caaggggtca atcctctcat gaccccattg catgaggagt cgccctacgc 

4081 ctcgacggcg gtaatcgggg tgaacgccca tagcatttac acctatacgc atcatcaata 

4141 aattagaccc cttagaaaaa agaaaatgat aacatagaac aaaggggggt aagggtatac 

4201 cagcatgagg cctctgcaac caacacattc tcggaaaaaa ggcctgactg agcatgcgac 

4261 tagcatactc ggcgcgttcg gagccctccg ggtaccaggt acacgataag ggttgagggc 

4321 cattaggaaa ggggttcttg tggaagatga gccattcgca gccaccggcc acgcgaccgg 

43 81 tctcggtatc gatcacatag atgagatggg aggcaggatt cttctcgtgc gcggcccagc 

4441 ggatttccgt gtcgagggct ttgtctttta cgcggtcttc tgcggtatgg ccgcttacgg 

4501 gatggacggc ggtgaatact gggctgtttg gtggtggtgc gaccgggcaa attagtttgg 

4561 atactactgt gcggagaggg gaggatagga ggatgggagg atgggaagga actaatgaag 

4621 gaagaaggaa gaaggaagga tggaaaggaa tgtgtggtgg taattgcgct cactcacttg 

4681 taagggttga agttggcgga ccagaggaca tcgactagac gggcatattc ctccttggtg 

4741 tggatttccc ggacttcgaa ctgcgacttg gtcatgatgg ggggagatgg ggagatagag 

4801 ggctggctgg actggactgg acttggactg gacttggact ggacttgggg aaggattgat 

4861 ggatggatga ttgattgaaa cggggaaatt gaggggcgcg cgggatcgag gggaacaaca 

4921 agaaaagaaa gcaagaaaga aaagtatttg acacacttga ctaatagata gtgaagggga 

4981 aataaattag ttaacctcac taaactacca gttaattaac taatttagta gccgaaagcc 

5041 gatgttaccc tgtcacggag tggacggact aaatgaatcc ctctttttag attaaaaaaa 

5101 gtagtataaa gtagactatt ttagggcgac actaccgtag ggaaaaagtc cctgatcaag 

5161 tagggaattg cgccatttcc cgagctgttt agttagtaaa aagcatccca actaaagctc 

5221 ggctagcgga ccctggtagt gcttccgcgc ccatcccaca tagtagatga tactgatgag 

5281 ggtaactccc gcgaagatga caatgctgta gttaatattc ttcggggcag ggtggttctc 

53 41 cggtggccag cagccgaaga acatcatgat gatcgagtag gcaatggcaa aaatgttatt 

5401 cgccgtgccc aaccaccctt tgacccggaa gggcccccag ttgaggacac cgccgccttg 

5461 acccggctcg gtaaggtcgc cgcgcaatct acgatagagc aggagagtta gagagatcat 

5521 gtatgaggca aagaaggcct ccagcaccag cgacaggatg atgttcagca ccactgccga 

5581 cccgatgtta atcaggctga gcagaccggc cgcgacaacc accaccaacg tgctgtggaa 

5641 gggcacgaga gtccgttggt tcacctgtag atatgacaga ccgcaattag aatcttccca 

5701 acaaaaacag cctgggggta gatgggggtt aggtccatca catacctgtc gcacccagcg 

5761 ccagccggga atggcctgct cgcgggagaa tgaccacagc atccgactgc cggaagaaat 

5821 ccccgcgagt cctgcgcacg cttgcagaat ggccatcata ctggagagaa ccacggcggc 

5881 acccttactg ccggtcgagc tcgccagcat gtacatgaag gggtacgagg cactggattc 

5941 cacaatcttc agcaggtctc cagcggtgaa gaggaaggca atcagaatgg caaagccagt 

6001 tgcaccgttg atcatgaggg ccaacagcat ggagcgggga acgacccggc tgctgttggc 

6061 gacttcttcg gccatctgca aggagttgag ttagtgactg actcggacag cggttaaaca 

6121 ccgagaagga agcacttaca tgaacagcgc catccgtgcc tagtgaacaa aaccgagtca 

6181 gtccattatg ttgcagtcga taagacgcgg gaagtctgcc ttaccaacaa atgcagccac 



6241 agtgcctttt aatccgataa aagtagccaa ggtgagatca ccccagccac cgtaatcgaa 

6301 ggatacagta aagatcgatc gcggatcagc cttgggcccc aaatagacca gcggaaccaa 

6361 gacggcgaag aagccaagca catggacaag aagaataagc acttcaatga ccggtaagat 

6421 catccccagc caagtattca gaactgtgca gacgagcaaa atcgcccaca tgatcagcgt 

6481 gatgtgccat cctttggggt catagctggg gatgttcatg gcaatggtgc tctgcagaac 

6541 actggcggcc aggtagatgg cggctgtggg ggcagccgtc cacgcaatga tgcagaccga 

6601 gcctggaatt tcctttgcgt tagccaattc cgaccgcggt tcgcatattc ggcacatgga 

6661 gttgcacaca taccagtgac atgactgaga aacttcctgg ccgacctggg ggccaagatg 

6721 ctcacccaat gatattgtcc gcccgcctgt ggtgagaaat taggctcatt ctcagctgca 

6781 gcaagtgtcg gacacatacc gaaggcatca tactggctag ctcccccatc gaaatgaaca 

6841 cggaaatgat gcctgcccat actccaagaa agccatacac cagaccggcg ggaccaccac 

6901 tgcacgaagg ttaacagaca tttttttctt tcagaggcaa gttcattccg tggcaggcct 

6961 tacttggtga gtcccaagcc aatgttccta atgatcctat tagcatgtac agatttgacg 

7021 ggataaaaag caagggggga agtggggaac cgggaggagg ggttaaagaa aaagaaaatc 

7081 aaataataag aaaatatcgc atacattata accgcttccc aagtcagcat caagctgcag 

7141 gcaaacccga cggtactcgt cagaccaaat cgtcgctagc aagggtagat ctcagcggct 

7201 gatgggagta aaatctcggg gtaaacacgc gtgttggtta ccttcaacac ctccttcttt 

7261 ccgaccctgg ccagctcgta agcatcgtcg ctgaaagtag ccttacttgg cgtgccagaa 

7321 tcccggttcg ttggagcacg cagctcgatc gaatgttcaa catcgctgtt cctggccatt 

7381 ttgggaactc gtcacgacct gtctaagacg caggaaacta gacaggtaag agaaggtgtg 

7441 atttcgaagg catggatttt aagaactgat tcggagtcat tttatgcaag cgcggccgga 

7501 taaaatccaa cgtagacggt taaccctgca ggattgccac acacactcct ggcaatagac 

7561 caattatcgg ccagactacc cgaagaagaa aagggtccct atccctatcg gctgcagaaa 

7621 gtatcgtgcg atgattgggg gccagatgca tcgcaagttg ggggccgtta tttcgatgtc 

76 81 tgaaggatgg ctgagtgcga gtgaaggaaa ttgccatccc ggtgtttagc tagttagtct 

7741 agcctatcga gaaatgttca actacgaaat tagacgatct tggctgccgc gtgtctcata 

7801 tgggcgtaaa ttgaaattgg aacgcgacca agcgactccg gagggaagaa ggaatgaagg 

7861 ctggcaagca gggggggagg gtggagttgt gtaggtggag aacaagacag ggagagggtg 

7921 tgcacaacca caaccacaat gggaccatcc aacgtcagcc ctagtctaca atagagtagt 

7981 agatacctgt ccatcttgtt cgggttggcc ggaaaagccc gcccaccacc atcaccaaca 

8 0 41 cagcacaaac caagaagcaa ttcagccaag gcagctcggc tactcaagca gtcttggcca 

8101 caagcagtaa gaccgagaca gccaaccgtc tcacatgtat tgactgctgt tgccccatcg 

8161 gggttagata ttcctgcaga ggccagggaa gtgcggccac taaggaatcc aggcggaggg 

8221 gcgcagagcc atagtgcgac gcttataggc tgatggcctt ggcgatcatc gggtctcccc 

8281 gactaaactt tccgggcggc acgcgccagc cccagtagta tccagtgtta tccagcgtat 

8341 cctgtctgtc cggtgtgtct ccagtgtctc cagtgcgacc cgtgctcctt ctccgcccgc 

8401 gtgcgtgcac agcatgtttg ctgctgggct gtgtatgggg ctgttcacgc actgctcaca 

8461 agtcggatct gcctcctctt cagtctggtc ttatttaact tatatgccta aaatcggtca 

8521 atcacgatca gcttcgtagc cagccaaaat ccttcactac ctgtgccttg gaatccgatt 

8581 cggtaccatg aaggttattt tgtttcaagc ctgaaaagcc atgacagacc attccgcagc 

8641 agtcaaggtc gtgacatgct tcctcctcat cgtctcgttc atcgcagtcg tagcctgttt 

8701 gacgacaaac tggcaggttc tgaggaggaa ggtgtctagt gtggcgctcc ttttgagcac 

8761 tttggttagc tagctctctc agataacgtg tcctcgtgcc ttttgtatag taaaactgac 

8821 tgccagtaga tcgcatcgat tgcctccggg gcagctgtat ccgtcgctgc aactcatggc 

8881 ttgggacagg cttcccccct cacagatgcc caggtggtcg tcatgcaaaa ggccctctac 

8941 tccatggagg tgctgtatgt tttgactctg ggactgggaa agctatctgt catggttctc 

9001 ttctacagtc tgctctcgtc gacaggccag tcgaagtccg tcctggcagc gacgggtcta 

9061 cttttaatat gggttgtggt aatggtcatt gtggtatgcc tgcaatgcca tccacccgaa 

9121 gtttggaaca tcgtcggcgg aacgtgtctt gacttggtaa gtaaaatgcg atgcgataac 

9181 cccaactaag aaaacattgc taatgtatac aagtccggga tttggatagc attcggggtc 

92 41 atgaacgtct tggtcgagat catgattatc gcagttccgt ccttcatcat cttccgtctg 

93 01 aaactgagtc tgaaaagacg cttggtcgtc ataagctgct tcggcatccg gatcctgtat 
9361 gtcgcaaagc cacaccgagt cttacacttc taacctcttg tctagggaca tcgcgggcag 
9421 tattgtacag ctatgctacg tccgtaattt caagatccac gccgactccc caatgccgac 
9481 caatgtttgg caatgggcaa tttgcagtca agtgcttcaa acagtggcta tcttatctgc 
9541 ctgtgtgcca taccttcgag agttcctgga gtcgttcccc agtggtatgt tcaagccgac 
9601 tgaactcaaa catcctacgg tgcagtcggc atacaatgcg accaaatgtt cggacagcga 
9661 tatagaatta atgaggccgg aatcaacaaa agatacttag tttaattaat tgacttgaat 
9721 ttcataacta aatacataaa tcttcgcgtt gtatcattga tcccttgtaa tatcccgctt 
9781 gaccatttac ttgctgatag cagcaacggt aagctcgtcc atctgacgga cgttggcgtc 
9841 agcagcggca cgctcgacgt tgttggtggt agggagggcg agagccacag ccgagagggc 
9901 gacgaggagg agagtagcgg cacgcattgt gtaagatgtg gatgtttgga gaagggttgt 



9961 ctcggaggag aagcaggagt ctgtcacaga gatttgttaa cccaaaggga ctcgtagaac 

10021 ttggtgtacc attaagacca ggcgagcata cctttgcttg ggaggattgg gagtgtgaga 

10081 cttggaagac ttgagaattg ctgctgaatg aacgaggagc gaggggggct tcgaggggat 

10141 tacataggaa ccagcagcgg aggaccggca aggtccgaag acaattgatc cgaggtggga 

10201 tagaagcagt ccgtcacata gtggagcctc agatagacag gattcccagt ttgcgacgct 

10261 actatcaacc ggtgtgccgt gctgaccagc cttgcattag cttcagctct ttcatgcaat 

10321 gtgaacccac atcgccgaca gccccattgc cggaatcgca tgatgtgttc cggagccgac 

10381 gttccatcag aattgtgaca aatgtgacaa gggctcggtg tttaaagtct gagcattgcg 

10441 ttgacccccg acgcccctcc ggctgccaaa cccctccatg ctccttcccc cctctcacaa 

10501 tgtattccta cgagaaagtc aaccgtgacg acgacagcga ccaggggctt ctggctgagg 

10561 actcggagag agtcgcacct cagcccagag tctctaggtt ccctagctgg acgaacgcct 

10621 tgctgctctt ggggctcttg ttttcgctat cgctgaacgt gggatggttt gtgttccaga 

10681 agggccacaa tgttcccatc caggacttgg atcctgatgc ggccatggca aaacagcgtt 

10741 ctccttacag tacgtactta tcgcaaaggt cactatatgg ggtttcatct aacattgttt 

10801 ccaatgacca gccggtctcg catgggatac tcacaagcct tactcccacc actccgagta 

10861 tactagcgag aacgcgaccc atgcggatga gatgtgggag agcctgagca tggaccccat 

10921 ggttattgct cctacttggg aatgggctca gagcaagggg ctgtcggatt cttgggcttt 

10981 cccttgggat tcgaaccgga ggatctactt catcaaggtg ttccatcaac tccactgtct 

11041 ggtgagtcaa attcgcagca ttcatagtca aaatggatgc tgacatttta cttttatcta 

11101 ctagaaactc atgcgccact cgtaccatga gctttggtcc ggacaagaga gctcgatccc 

11161 cgctcctcat atcgaacact gcctcgattc cctgcgacag gatctgatgt gcaaggccga 

11221 tgataccccc atgccatctc ttcagctgtt gaatggtggt ggagaaggac aacaaatgca 

11281 atgcaaggac ttcgataagc tggtagcctg gtcgaaggca ccggaacgca atgcttgcta 

11341 caagcgactg accgactaca agccgatcgt ccatagcatt gagcgctatg ccttctgtcc 

11401 tgaggatagt gagcactacc cgacgatgag caagtatttc gaggaacacg gacattatgc 

11461 cgatcctttt tcggaataga caagggcgag gctgatttga atcggttggg gagggtgaaa 

11521 agcagccaag aaaggccagc cgaggggagg aggggaaaag ggccttttag cgccgaaaac 

11581 cctagtcagc acctaattta gtctatttag cttcaatgca actaagaaag tagcaccgtt 

11641 tagtggccag taactctgca agcccccctg caagcccact gggggccaaa cggtaccacc 

11701 ttcttcttcc tgttagtcca tttgcgtggc ctgagtggac aaagtacttt atctgcttat 

11761 ctgcttactg tcaaaggagc atcaagcctg atcagcctcg gtctgagcgc aaacccttct 

11821 ctcgtccgat cttcccctcc attcctttcg tcccgtccct tttttttttt tttaatcaca 

11881 catcggctcc ctttcatcgc ctgtcttcct ttcctttcca ttaacttttc accaatcctt 

119 41 ccaccttctt attggtggtt atttaacccg tccataagca tggatgagga agatggccat 

12001 tgcaaagaga ccgaggtcac cgtccagtcg acgagccccg tggactcggc cgattatgat 

12061 cctttcgacc cccaaacccg ctccaccacg tataagtgga tcaccgtcgt ccttgtcgcg 

12121 ggattatcaa ccatggtgta agattgatct cccataattc tcaaggtcaa ggcttctgac 

12181 tgttgcgccc gggaaagtca gctgtccacc atcatagctg cccctgtctc cccgtccatc 

12241 ttggctcatt ttcattccga caatgccctc taccgcaccc tcattgtttc catctgggaa 

12301 ctcggtgaaa tcgtagcacc gttactatgg ggcccgcttt ctgagttgta tggccgacaa 

12361 tggcctctga atattgccaa cctcttcttc gtggccttct tggctggcac tgccgccagc 

12421 accagtatcc agatgctgat tgcattccgg ttcctgagtg gtgcggcgac tgcagcatca 

12481 gccatcggtc ctggcattgt cagcgatctc tttccagagg aaagtcgcgg acgtgccatg 

12541 tctattatgt ccttgaccgg tgcattggga ccagtagtag gccccatcat cggctcctat 

12601 ctgggtgaaa aggccggatg gcgatgggca ttctggttac cgaccattgc tacgggtact 

12661 ctgtccctgc tcattcttgt tgtgtaccgt gaaacgtaca gcgtcactct acagaaacgc 

12721 aaagctagac agcagaaccc cgagagtggt tccaagagtc ccgaggatac agacaagacg 

12781 gccagtcaag tatttttcaa agctatcctc cggccacttc gtctgctcat ccggtccccc 

12841 atgcttatac tagttacctt ctacctcagc gtggtctacg ggtacaccta cctggtgatg 

12901 acaaccatcg caccactgtt ccaggatgtc tacggcttct ccgaaggagc ttccggccta 

12961 gccttcctag gactgtgcct tgggcttatt ctgggggctt tcctttgcag ctttctgcta 

13021 gatcgctatg tacgcacagc cagggcacgg tccggcacct ctaaaccaga gcagcgcctt 

13081 cctcccgtac tgattgcatg cttcgttatg tcaggcggcc tttttctctt tggttggact 

13141 gcacagtatc atgtccaatg gattgcacca attattggta cgggcattat tgggtttgga 

13201 ctagtgtcta ccaccatcac cttgcagact tacgtcgtgg acctctttgg catctacgcc 

13261 gcgagtgcta ccagcgcaat gctggtgccc agaaatgcct gcgccgcatt cttgcccctt 

13321 gcagggccac cattgtttga ccgccttgga tacaactggg gcgggactct actggcgctg 

133 81 atcgtattgg tgttctcgct catgccccta atctttatca attatggcga gagattgcgg 

13441 ggaaagaatc tacttgatga ttgagatgtc gatggaattt gtatagtacg tttagtatga 

13501 caaaagtatt aacatgatgg taatgaaatt acttgggaat gttctcattt tgaagaagta 

13561 tctgctgcgc atacactact ctccagggct ggatccgacg cacgactagt atatacccgt 

13621 ctaggaaggc cttgcgggga ccgatctcag atggacagaa tgactggccc tgaatgctaa 



13681 atgctataca ggcaacaaca cctcgatctg aatatactag tatccagctt gtttaacgca 

13741 tgatatcata cgtacttgat gtcctacaaa ggtcgatggc cttcccgtcc cgtcctaaga 

13801 aacttaatct ggctctgtgg atccttgtgg gcctacgaga tctggagcgc cctggctgca 

13861 ttcttcatca acatcaggga aagaagagac aattgaagct gtgatgtggg ctccgtccat 

13921 aaagttgaac atgaagacaa attgaataca ttaaattcca aacgtaatca ttgaaattga 

13981 acactcgcaa aattcgcttc cttcagacat catcggcagt tcaaggttct actgggacac 

14041 cagcgtcgcc agatccgtat cgatggagcg gtagaacccc agggccgaga tcataggctt 

14101 atcgtcgaag cggtaaaggt acacggtctc cccctcagcc gcaatatgcc gatacttgta 

14161 ccaggacggc acgcagaatg tatcccctgt ctcccataca agagtcttgt ccccgatttc 

14221 cgagcatccg cttcccgcaa taacatggta gaccgcggag gttgtctctt gccggggtag 

14281 ggaagaggta ccagcgtcga cccgttcggc actaccgcct agaacacgac tcactacaca 

14341 ttgagcctgt cagccaaagg cgtacctatg ttatattgca aagcagaacc aacaggcggg 

14401 ctaatcatac cttcgctacc atcagcccta agatatcgtc tcacagccca ggtacccggg 

14461 gccttatcca gctccgcctt catcttgtcc cacgggaata caatgggcga ggtcgcgcta 

14521 tcaacatcct ccgccgggta ccgcggctga tcatagtgat cgacaaaatg cacagggaag 

14581 tggcggaaat taggtagatc cagaccgtcc agccagatca taggcccggt tccgtccttc 

14641 ccgtggtcat ggtagttcca cgtcggggtg agaatcacgt cgcctttctg catcttgatc 

14701 ctcttgccat gcacagcagt aaacccccca ttgccttcga taatataccg cattgcgaac 

14761 gcggtatgac ggtgggcggg ggctgtctca ttgggcatta cgagttggag acctgcgtag 

14821 agcgtatcgg ttgtgtatgg cgcatctagt cgtaagagaa taaggtgaaa ttcagcgtca 

14881 ttgtcaaact gtgcattggg aaggaatact atatacctct agcggggttg acgagcatga 

14941 gcactcttct ctcagcctgt ttctcggtga cgagttcgcc ggcgcgaagc aagtttggac 

15001 ggatgctatc atatctccaa atgaagggta cagctgttgg attaggctct ggcgggttga 

15061 gtctggtcat ctgcgtccat agtggcagag tattgctctt cttcaagtct tgtagaagct 

15121 gctctgcgct gtctgcgccg ttcgacccgg tattgcaagc tgatggagac ataatgggag 

15181 tgcgaaagtg tcgagatcta gcagttcaag gtattctgga ttgattgatt tcaagcatgg 

15241 cccatccgta ttagtatatg tacgtcgatt tagaagcgac atcaccatgt tccgtaactc 

15301 gcgccttcga tcgtgccggg gcggggaaag atcggaaagg ggacggttga cggggaagga 

15361 tcgggaacgc cggaatagta acgtgttttg gaccttttcc cagctgtaaa caaccaatgt 

15421 tttgctggtt taatagatcg tgtcgttact accctactta ggtctgatta ctgtgttcac 

15481 acaatagata tttgtctttc cgtcctgcaa gagggaccac acgtgcgcga caccgtgtac 

15541 aacggattga gcgagagtgt caaagttgaa ggccaccgta tctacggcag acctcaatac 

15601 cacgaatatt ggaatggctt ccacgcggat gagtgcgcag agcacgattg cggacctcag 

15661 ggttagacat cctactttgg ctgctagaaa ttgcacacct cgcctgtcta acaaaactcc 

15721 aggtcccgga ctaccccagc aactgcctcc aattcacgca gcagctcacg gcatctgcca 

15781 cgcatactct tccacagaaa gatgcgtctg ttcgtctgaa tatcattctc gctggtgctg 

15841 gattgggagg tctcgcaaca gcaattgcac tcgcccaagc cggccacgcg gtaaagatct 

15901 acgaacagac tcccgtcctc ggagaagtgg gcgctggtat tcaaatccca tccaactcaa 

15961 cacgcatcct gttcagctta ggactacagt cgtacttaga gccatacgta acggcgcctg 

16021 aatcgatctc gtttcggagg tggcagaacg gcaaggtcat cggcctgacg aagctcatcc 

16081 caaatttcgt caacaatttc aaagctccgt actacgtgat ccaccgggcc gacttccatt 

16141 cagctctatg ccaaaaggct cttgatgtcg gcgtggaaat agaactgggc gcgaaagttg 

16201 tcgattatga tccgattgtc ggcagcatca cgcttgccga cgggacgaag cactcggcag 

16261 acctcattgt tgcggcggat ggtttgtttc tcgactttta ggataaaaca gctgatacca 

16321 cggccgagac cgctaactgt taccaggtat aaaatctgtc gctcgaaatg ttgttctaca 

16381 aggggatgag atgcgtttcc agggacctgg tttcgctgcc taccgtgcgg tcgtcgatgt 

16441 gggtaagatg agaagggacc ctgacctgtc atggattctt gaaaagcctg ctctgaatat 

16501 atggtattcc tttctctccc tctcaaacag agtccctggt atcatgtttt gcttattgtt 

16561 gctgcgctaa atgcgcattc taggattgga gactccaggc acgtcatgac atacacgatc 

16621 ggcgctggga aggctttcaa tatggtccta tcacatccgg aaatgaccga ccccggtact 

166 81 tggaaaccgg agacggcact cgaagacatg aaagcggaat ttcagggctg ggatccaatg 

16741 taggtgacct accacttggt tcgatccgaa gcgccctcag cactctaaac tcaattgggt 

16801 agactctcaa aaatcatcgg catggttgag aaaaccgtca aatggccttt gctcaccggg 

16861 acgttactcc aaaactggac cgttggaaag ctcgttatcc tcggcgatgc tgcccatgct 

16921 atggtaccat acatgtcgca aggttggttg agccattcat tgcccattgc tatcgtcgat 

16981 gaagttacca atatacagat gcataccgga agctaaccct ttttactacc caggcgccgc 

17041 aatggcagta gaagacggca tagccctgtc ccgctccttg tcacacatga cctccagaga 

17101 ccagttgcag aaagctctga gtattttcca ggaagtgcga aagaaacgcg ccggccatat 

17161 gcaggaggcg agcctgctga acgggaaatt gtggcatttc ccagacggat ccttgcagca 

17221 agcgagagac gaggcgatgg cgcctgaagt gcaggggatc cccttcagtc atagtccaaa 

17281 ccagtggagc gatccggcga cgcagatgtg gtgctatgga tatgatgctg aggaggcgat 

17341 cgatattgct tggatggaga gtctggaggc gcgggttgat tgtgttcatt aggaatctag 



17401 atttgcgcag gctgttatgc agaagaatca aatttacata cttaggttct ctatgaagat 

17 461 tgctactctt tggcaaatta gcgattgtat cttgtcgttg ggtgtataat atggggcaac 

17521 gcaaactttt gacatactaa ctagttactg taccggaatg ttttggctat ttttggccac 

17581 accgactgtg gaaccggggt aagtagatca cgtgatttgg ccgccttctc cggatttccc 

17641 actctttccc cgaccgttcc caagtactct ccgcatctac gcttcatcgg cgctccagac 

17701 tcagagacgc gttggagtgg catgatcctc attttcgatc tgctactttt acccgctaga 

17761 ccaaaatgga ttttgctccc gagctgccga ctccgcaggg ccagtcagtc aaacgccgtc 

17821 gtgtggctct agcctgcgat gcatgcagaa ctagaaaatc aagggtatga tttcttcaga 

17881 cttcaaataa cttgtcatac gttatctctt gtagttgtaa ctctttaaac tatggcagtg 

17941 cgatggccgc cgccccaaat gtggaatgtg cgaagaccta ggcttcgagt gtgcctatac 

18001 gccgtcgaca actaccacaa atataatcgt gcaaaaagag tacgtcaata atctgaccat 

18061 gaccgacatc atgccatacc cctcacgaat ttggctaatt tgttcagtta tctgagagac 

18121 cttgaagagc gcgttaaatc gctagaaacc agcatggcca ccgtcaagtc tgatctctct 

18181 ggctttgcgt cgcaaattaa tggaagcatg cctgatgccc ctaggaagga gaatcccttg 

18241 agtaaccaga gtgaacgatt ttccgacttt gctggcaccg aagacacaat cgatgccatg 

18301 ggcgctgttg catttgccga cgaggaagat tgtgggttct ttggtacgtt ccctatcctt 

18361 atagaaatat agacaaatac agctcatacc ggtttaggtc cctcatcgaa cattgccttc 

18421 ctccgccatc tctcatgtgc agtggcacat agcgcaagcg cgcagaagga aatcacctct 

18481 cctccgttgg atcgcgtcgc ttacgacggg ggctttgtca gtgctacacg accttcatct 

18541 ccttctcatg atcagcggcc cgagagtctc gagggcgaga agtttgataa attcgcgctg 

18601 ccgcctcccg aggagtcatt ggcgctcatc catcgttatt ttgccgatac tgggcttctc 

18661 ttcccttata tccatccacc tacattcttc gagacctacg ctgaactcaa ggacaattca 

18721 aagcgggtcc gacgcacatg gcttggcttg ctgaacataa tacttgccat ggcgaagcta 

18781 acagctgttt cggggacgac atctgcagag acgggaatta gcgagtcagc tatttattat 

18841 caccgggcgc tgagcctctg caagggagag attttacgcg gcacgacact agaagttggt 

18901 aggtacagca tctatgctcc ggttagggct cgagtgcccg tgtatttgcg tgccactaag 

18961 atcaacacag tgcagtatct cctggtcatg ggccagtatc ttcaagggac gcagaagtct 

19021 gttcaggcgt ggacaatcca tgggttagcc gtgaaggctg cacttcaact cgggcttcat 

19081 tcaaagggcg cctcgagggc tttcaccccg ctggaacaag aggtcagaaa gcgaacgtgg 

19141 tttggatgtg tcgtgttaga tcggtaagta cctttcgcgg tctccgtaat aagatctatt 

192 01 cccttctaat tgcttcgtgt caggtccttg agcatgactt ttggacgccc ggcggcaatt 
19261 ccggactgct atgtgaaact cgacctacct gcattccaag gacgcagcga ggttctttcg 
19321 ataggcaacg aaaccgatgc ccataacagc attaggttct tcaacgccac agtgttcgtg 

193 81 ctaatctttt cctcctcaac cggaaaacaa ttcagcaggc taacatcagg gtcagtacac 
19441 tgtacaaaca aatcgctaac attattgatc gaatatatgg gcaaaatctt ggatgcgatt 
19501 ctcccttatc tataggtgaa actgtcggcc gtgtcctcgg catagaaaac caactcttct 
19561 cctgggtatt aggactcccg gagagtctcc gccaagtgac ggtccaaagc atgcgagagg 
19621 agattgagcg ctcagaagta ggagacgagg gccatcacaa gctctatccg ctgaagtttc 
19681 gtatcatttt aacactgcga tatttccacg ttcaaatact cctacatagg cccatccttg 
19741 tcaagttcct agacgcaaca gggccttccg gactcgaagc agacgaggtc aagctcctga 
19801 atgacatcgg ctacagtagc atgaacaaat gtgtcgactc ggcaatgggg atcatcgaca 
19861 tcatccatga acttgtttcg actactggat ggcagaaaga ccttcttggc gcttggtggt 
19921 actcgttata ctatagtacg tcacctaccc tatatctgat tgattgccgt gttcttttct 
19981 tctctttctg gctgctggct ctaaattcac ttccctcaac taatactgaa ccacagcatt 
20041 caatgccgcc ctggttatca ttggggccat gtgggtccaa cgtaacaggg acacgtctga 
20101 ggatccagtc gacaatggag ggaacaacaa caacatggat atctacctaa gccgcgctgt 
20161 tttaacattg cagaacctta acgttggaaa ccgcatggta gaccgctgca aatactacct 
20221 ggaccagcta atcactttgc tgggtctaca aggtaagatt tttcattcca tccactcaaa 
20281 aaagaaaata aacacagcgc taatctgagt gacacaataa acagcggata gaatcgcgga 
20341 gaccgggatc atctcgggca tgaacaatcc ggtggcagga aattcaacca ccggcttcga 
2 0 401 ttttccttcg cttggtatcg aatgcgggga gtttatgata gatgacttct ttccaggctt 
2 0 461 tgcccggggg tcggggtttg aacggtggta gggagggggc gccccgggtg gggggattgg 
20521 atttcaaacc agtttttctc tgctcccagc tccttttatg atagggcgca tgcaaaaaag 
20581 ttatgtgtga aaatagctaa gacctgagga tgaaacgata tcggggaagg tcccggcgag 
20641 gttttagact cgcctagata ccaagcaagg ggacaagcaa cgaaagaaat cttaagaccg 
20701 caattggtta actatgtaag ttaggtactc attacccccg cctcatgcaa tccacaaaac 
20761 tactcctccc tactaaagac gtgcggcaga taacggaaca tcactcccat ccacccaact 
20821 cccatgcaac cccggcctca gtcgcacagg aaggagaatc ttcgccctcg cccgcgtgaa 
20 881 atcagccgta tcgagcaaat gcaactcact cgacatcgag tcatagttgt tcaccagcgc 
20941 caggagatac ccatcccctt cagcagccaa ctcactacgg gggataaaga cgggctcttg 
21001 aaccatatgc gtcttcccag ggaaatacac ctccgtttcc cccgtaacat tatcaaaacg 
21061 cgcaagcgag ttgtacagcg ggtatccacc gccgagcaca ggtgcaatcc gcgcaaagtc 



21121 tgtgccgagg ctcgggtcca tcatgtcgaa gaagacatgt cgatgaggct tcgtggcgta 

21181 gcgatcatcg atgcggtaga actctgagtt gtccgcctga aggatgcgtg gttcggtgag 

21241 gttcagatct tcggctcgtg ggttgatcgt gaagcggaca agttgggaac ggatcgagct 

21301 gggctctggt gcgtttcctt gcgcgtcggg ccaccagaag aaaacattct tctcgcttag 

21361 cccgagatcg atgatgaggt tgccgctcga gtcttcatgt gcgtttgcgg tgtgccccgg 

21421 gaatgaattc ttgtactgga accactgcac ccaggagtat tagcaggcgc gatgaaatta 

21481 aagcggctgc ggatacctac cttgacatcc gcagtctttg ctccgcggcg cggtatgacg 

21541 cctaaataca gcggcgtttc ggggctccat tgccaatgct cgccgccttg cttcatacgt 

21601 tcgatgtcac acaactgcgg gatgattggg aagataacct gtttggatat tagcattgct 

21661 tccacagtcc tgtctagtgg aacgagggag attgattgct gaaacgcacc cagttgtccg 

21721 tcaccgcgaa atcatgaatc atcgcgacaa caggtgcgac cagccaaacg acctcggtaa 

21781 acttcccatc ggggcccaca gtgtagtagc agacgtcagg ggtcccatcg ccccgagctt 

21841 cgtaaccaaa gcagatcatc tcgcccgtct ccggatcgaa ctttggatgc gcagtaaagg 

219 01 tcatactagg tagctggccg tcaaagtcgt agagacccct tgtctcgagt gtgaccggat 

21961 ccatcgcgta ggggggcgaa tcctccttca gcgcgaggag ttgcccgttg aagtacacga 

22021 tgtttgtatt agcggtgctg cggactttga actcgacggc atcggtaaat ttgttgcggt 

22081 atttgccttc aatagaccag aggaataggg tcagctgtgc tgtgcacacc ctccgtggct 

22141 agcttgctca acatgatata gaaaggcgga gctaccaacc tatcagcgct cgttgagcct 

22201 gtcgctcccg gacaaacttc tctgtcctca catagcgttg ctggaaggag actctgccgt 

22261 catggaatcg aaaggcagtg atgttaccgt caccgttgaa ccactgtatc gtcattggct 

22321 tactagttct cacacgcgca ggagatgaat gttggatcac ttcactcaca ggatcgttct 

22381 cgataaacgg gggtagctga gggtcaggca tcaccctgta gaagacacca tcaatgtctg 

22441 tagggaggtc tcccaagacc tccagattca gggtttcgcc ctcgacacgg cacggcttca 

22501 taaagccgga gaactggggg cgagctggaa aggccgtggt tctagtttca gtcatggctg 

22561 caatatgtgt atttcctact aaaggaacag aactagggcg tgagatgtcg ctggatcgtt 

22621 ggaccaagag gtatgcgttg tgctgcagaa gactagactc gaatccgcct ttatatcagg 

22681 gcctgagata gaaaccacca cttctcctca gggtttgacc atcgtcacta aaagtggacc 

22741 gtaacttcgg cactggtgga aagccgcaga gtaatgatcc gccgtgacta cctggtcgtg 

22801 aagccaatga agactaagaa ttcccccgcc tctcgggctt tgggtcatcc gttgcaaaag 

22861 gtggaccgca acttcggcac tggtggaaag ccgctgagta ataatccgcc atgactacgt 

22921 ccgcctttcc gatgctcccc cgattagggc aaaattgacc cgaccactcc ccgtgctaac 

229 81 cccggcgctg gaactattat gcgttgaggg cggccaccac gatttacttg atagtcacca 

23041 agcctgtttc tccctctggt tcctataagc tgtcatacct taaagaacgc ctgttatgtc 

23101 tctcccctgg aagcgtctga ttcgctttca agcctcggat ggccgcattc tgcgcggcga 

23161 gcctattatc ccggcagcgg gagaaaccga cctagggctg atcaccgagg tcgatggtct 

23221 tcaagccaaa ttgcttagcg gaagcgacat ttacgatgtg accggcaaga cccaggtgac 

23281 agacgagatc gtgaccgtga agcgcatcct ctcgccactc gcacagagtg atgtacccat 

23341 catgcgatgc gtgggtctca attacgccaa gcatagtatg gacgactttc tcctcggatc 

23401 tatccaagtg cctatgtacc aacctgacat gaacttgaac agtccgagaa gccgggcgca 

23461 agcccccacc gtttccgttt attttcttca agccgaatac ctgcgttcat gatcacggcg 

23521 agccggttgt gataccaagg atcgcacaga atgaccaggc ggactacgaa ggtgagctgg 

23581 taggtatctc tgagagcctc ctgaccattg agtatgtcgg aatgccgttt cttactcagt 

23641 acttttctac tatttttagt gcatagtaat cggaaaagat gccaaagacg tcccagtcga 

23701 gtccgcgctc gactacatcg cagcctacac agttggaaat gacatttcct cccggaagct 

23761 tcagcgtgat cccgcgtatg ctggtgtcgt ccctcaatgg gggttctcga aaggcttcga 

23821 tacatttgcg ccgctggggc ccgttctagt tgctgcctcc gaaattcctg atccatctca 

23881 attacaactg acaacgatcg tggatggaga ggtgaggcag caagaatctg tcggggatct 

23941 tctgtttgat tgcaggtact tgatctcgta cttgtctcaa gggactacac tgcagaaggg 

24001 gagtgtcatt atgacgggaa ctcctggagg taaggggaaa ttctactgct aattatctat 

24061 gttgaaacat gagtaagagc attactgact gggactcagg tgtcggcgct ggatttacgc 

2 4121 ctccaaagtg gttggtgcct gggactcaaa tggatgtgca tatttcgggt atagggacgt 

24181 tgcggaacaa tgttgtcttt gcgtagaacg ggtcgaactg ttgacgataa agaatctgat 

24241 agagacaacg cattttcatg gaaataatac cagtatatat tccacgtttg tcgcgcacca 

24301 tgtgcggaat tatcaaaaca tcatttatcg atctcaatta caacatttgg accggttcca 

24361 agctcaagtt cctcgcgatg ttgtaccaaa gggttctatg gaagtgctgc gaaatctcat 

24421 tactgcttca aatcgattta aggaggacac cgagctagca atcgtcgaga caacgtcaac 

24481 agtacggctc atacatcact aactgtagtc ggactgatac gatgcacatt tctacagcgc 

24541 aaaacccggt ctcgtcggtg gcttcaagtc cctcaagagt atagatccaa gaagccattt 

24601 atgcctcgca ctatcccagc gacctcaaag cgaggactgg tcttggtata ccactcccta 

24661 cccccattgc tcatagacgc gattgccaga tcatcggcct cgaatacaga cttcatagta 

24721 tcctggcttg ccctcccata gccttcatta gtaattccaa ttatcgtccc gtcatccgct 

24781 tgtatggtat atcgaacgta gacatggatc accccgtccg gacggaccgc gttccagtcc 



24841 ccgccgctgt tcgccagtat cttgccgttg agttttgggc ccacgacgac tcgtcctccg 

24901 gtgatgggga tgaatttcga tcgcccgaga acgtcgcgcc aaagccgacc gccggggcta 

24961 ggtcgaccta tgagttaagg aatatgcgag gagtggagga gttggcatca gaaaataagg 

25021 acgagacgag aagcggccgc cgcaatctgg gaggaaaggt cagcaactga cgaggagaaa 

25081 aaggaaaacg gtcgtgatgt ctctccaaga aggcttggct tctatccttt agctagtgct 

25141 actcctccat tgctgcgcgg cctagcaaca ttttcgggga ggcggggaaa gatcggtaga 

25201 cttagcttta cgccctccgc gccggggtac ccccggatcc ccgcagtctc agctcgccca 

25261 cggagtatca tgcagttctt ggctcgtgcc tagagtacgt agtacgtacg ccagttagca 

25321 cttgagtctc caatcgaaat tgtcgcgccc atctcccccc tttggcatac aaccatcccc 

25381 cctttttgtg tatggttgct ggtactagcg acgttgcctt aagatctgac ctagtctgct 

25441 ctcgacaggg gtccgcttct ataccattct tccctaccct atctgtctct ttctcttgac 

25501 tgaccaccac ggcaacctga acgatgccta ccctgagcca tcggggtctt caattgatcc 

25561 cagtccctac ggatgatccg ggtgatcctc ttcgctggcc cagctggctc aaattgtctg 

25621 tcatactctc gacgtccttg gtcaattttg tctccaacat gggtggtgct ggattgtccg 

25681 ttgccgtccc ggtcttgatg cagcagctcc agcgttccca agcagaggta acccagctac 

25741 tcacggtacg caattcatat aacctggaaa cccacgggta taagcagagt ctgctgactt 

25801 attctgggta gcttaacttt ctgttcctgg gcatcggcaa tatcttctgg gtcccgcttg 

25861 cggtcaagtt tggcaaacgt gcatcaatta tatcctcgat gttgcttcag gcaggggccc 

25921 tggtgtggtg tgcggtagct acgggctatg acagcctctt gggcgcaaga tgcgttcttg 

25981 gatttgctgc tgctgctggc gaagtaagaa gccgctccca taaagattca cctagtctta 

26041 tcacctggat acgccgaatt gacgttggat gttctataga gtatcgtacc cgagatcgtt 

26101 gctgatatat tcttcgtcca cgagcgtgct acgatgatgg caatgtgaga ttttggaccc 

26161 tttcatagta caccacctgt tgagactttt cctgatttga tatagctatg ttgtcttgat 

26221 ctctggaggc tcagctgttg gccccttaat tggaggtttt atggtagagt acgcctcggg 

26281 cacgtggcgg tcatacatgt ggctttgctt cgcgctcgcc agtttcaata tagtgctctt 

26341 ggtcttcctc ttcccggaat cgaccttcaa gcgaccggaa ccttctcagg cacaggagtt 

26401 acagctcgtc gcagaacaca aagagactac gagttcattc atcgagaatg cctcagctgc 

26461 agctccagag gactcctata ccaccaacac gccccccttt aagggtgtcc ttcggccgtt 

26521 ccattacaat aaggatgcca gctttttcaa ggcagtgatc tcgcccctca agctgttggt 

26581 tcatccgtcg gtctgttggg gcatttttac ctacggcata tctctaagtc cgcaaatcat 

266 41 tatgatgtac gtgctcctat gagaaactct gtagcggaaa aagaccagcg cccgtgctaa 

26 701 taacagacaa acttataata gctttaattt ctccgctctc ctagaagcgt ctccatactt 

26/61 cttctgctcc gagtatgtcg gtctcatgga aatcggtgct attgtgggat tcctactagc 

26 821 atgctacggc ggtggctacc tatccgatat cgtaaacaga atgattgccc aacgcagcag 
26881 cagtagcagc atccgccccg aacaacgcct catctctctc attccaggaa tggccatcgg 
26941 gccggcaggc tgtatcctgc tcgcgtttgc ctgcggccac aaactccact gggccgccat 
27001 cgcggttggc ttcggaatgg tctcgtttgg gaccgtgtat acgcccaaca tcgctcttac 
27061 gtacatcgtg cacttgcatc aacaagatgc cgcgaactgt cttgtctcaa ttagcttggt 
27121 caaaaacctg gtggcattta cgtttctgta tgttgctgtg gactgggttg agagggaggg 
27181 ttacgtgcag gtttttatga ttatgtttat gttgaatgtt ttggctcttc tctgtgcggt 
27241 cccgttgtat tttctgggac ggaggaagga cttgaacgta tataatggga tggaggggcg 
27301 tgcgtaagtc agatgtggag agatatacat cttgcggttc tttcaattgc accggtctta 
27361 tgtgtaaaat tgctgcacca tagaccgtgt tattgctaat agaattactc acaaacccac 
27421 agctaagctt ctcgtaggag gctaatctgt ccatacgaag catacaagct gtagagtaag 
27481 ataacatgcg tgcacaacat tgtttgtcga gtgtaaggcc cggcgatttg agtatcataa 
27541 aagggcactt agatagatcg gtacgacact attaaacggc agcctcattt gctcgtttac 
27601 atcctcctag ccgctcaaaa aagcgtgtgt tagtttgcct ccctttcctg agctttgagt 
27661 aaagaaaaca gtatgcagcg tgagatggtg gtcagccaga taagtaataa acaaagtgtc 

27 721 atgctttgtg gcagggtttg gtgtgccaga ggctggtagg ggcggtgata tgccgattta 
27781 tgtagcacct agccttctaa aacaccccat actttaatag gagggcccag cacatcgtct 
27841 gggtgcgaaa tgatttactc ccttctgcgg ggaaatccgc ctttgccgcg acgcggaata 
2 7901 tccccccagg gtatgtggac tgtaaagact catgaacatt aagactctgg tccttcttcc 
27961 gcacggtcgc gaacgaggct gtgtccttca ggcatcgata caggaacagc cgcaatagat 
28021 actcttcttg gactcgagga taataatatg cccggagcat aacacaacag caatgcttgt 
28081 tggacgacgc aaaattgagt tcctcgtagg cagcaggatt gtgtttccgt aggtggccgt 
28141 aacgccccgc ctccaagctt ttaacaggtc caggccaact cagggttctt ccgattgacc 
28201 acggatttat tatccactgg ctgtgcaatt atagaccaat ggaatgtcgt gttgatacct 
28261 gttatttgac actaagatga catgtcgatc gaagcttgtg ttggtacccg aaacagtatg 
28321 gtttgctggt accgtaggag ggctgtgaaa tgcctcaatc cctaaatagt tattattcga 
28381 tgaggtcata gcactagcta tgcagatctg ctcagcccaa gtggatttag ttgatcgtgg 
28441 cgtctttact gtggatgcga gacaacgaat cgcagactgt aaaaacccgt ggctggtgaa 
28501 ctgtccgcat tactagtatg gatgagagga acagcctgcg cagctaaagc ctgtagaaca 



28561 agaaactccg aaagaagaag tatctactgt gcatatacta ctctccaggg ctgaatccga 

28621 cgcacgacta ctatataccc gtttaggaag gccttgcggg gacccatcac agctggacag 

28681 aatggctggc cttgaatgct atatgcaata caggcaacaa cacctcgatc tgaatatagt 

28741 atccagaggc ttgtttaacg catgatgtca tataattaaa tatatatata taatgatcgg 

28801 taccttccag ccataacacc aacttcagca gcaacaattt caattctctt gcagtcccaa 

28861 aagtctcgtt gcaatgctgt cgcttttaat atcagcagca gctgccactc tcgcatctgc 

28921 cctggaactt ccccagggtt attccccgga tcctgtctct tgcccaacaa atctttcatg 

28981 gatccgaccg gcagttggac tcagcagaga tgaagcgcaa tgggttgaag ggcggaagaa 

29041 tgtcatcctg ggctcattag acgcatattt gaaacgactc aacctggacg acttcgacac 

29101 agacgaatac atatcgcgtc tcaacaacac cagtcagacc ccaatcatgg gaatggccat 

29161 cagtggagga ggtttcggat ccgcctacac cgggactggt ctaatccgtg ctttggatga 

29221 ccgtcttccc gcagccaacg agcaacgtac cggtggactt ctacaaagca tgacctatct 

29281 gtctggtttg tctggaggat cctggcccgc agtgtccttc ccatcataca actttcccac 

29341 cgcagacgag attgtcgatt actggaaacc ggagattgac cgattcttca cggtcacgaa 

29401 cacctctgct gaagctgcta ctggaaaggc catctttgaa cagattgcta cgaagtacct 

29461 ggctggcttc gaggtagcgc taagtgatta tctaggacga ggatttgcgt acgagttcat 

29521 tcccggacaa tccggcggcc taaacaccac gttttcgggg atccggaatc taagcaattt 

29581 tatcaatcat caaatgccga tgcctatcat tcatctggct tcagttgaac cggaagatgc 

29641 agagtactac gaccttttgg tgccgtcatc taatggaacg attgtaagta gtgcttcttc 

29701 tcgataaact accagctcca gctaacgcgg tctagtttga tttgactccc ttcgaattcg 

29 761 gcgcctggga cggagacgtg catgcattta cacccactga gtggctcgga aaccaactat 

29821 ccaacggtat tcccgtaaac cagagcaaat gctggaaagg atttgatcga tcctcgtaag 

29881 taacagtatg gccccagcct cgcacgcttc taacttcatc ccagacttgt catcggcacc 

29941 tccgccgacg ccttcaactt ctggtacctc gaaagcgtct ccaacggaac ccttggccaa 

30001 tttgccaaac gctccaccac tcacgagtcc tctctcacca aacgattgtc ccaacctgcc 

30061 aacctgaacg cactcgttga cgccttccaa gagacctttg atctaaacct aacccaaatc 

30121 tcctactcga aattccccaa cccattcacc aacctatccc tctctaccgg taatacccac 

30181 aaatcctcaa ccctaaacct cgtcgacggc agcgaaacag gccaaacaat ccccctctgg 

30241 ggccagatcc agcccgcgcg caacgtcgac ttcatcatag cctgggacga ctcccaagac 

3 0301 gcagacccct acagctggaa caacggcaca aacctctaca acacgtacct cgccgccaac 

30361 gcaacagggc ttccctttcc gataatccca ccctccagaa caatgatgaa cctgaattac 

3 0 421 actctccatc cacaattctt cggctgcgac gccaacctca ccaccacagg cgacgaccgc 

3 0 481 gcacctatcg tgctgtatat ggctaatgcg ccgtatagcg catacacgaa cttctcgttc 

3 0541 tggcagacgg agacgagtcg gcagcaaatg ggggagatat tcgtgaatag ttttgatatt 

3 0601 gttacgcaag cgaatgggtc gtgggatggg gagtgggcgg agtgtatggg gtgtgcggct 

30661 gtggaaagaa gtttggcgcg cgtgggcatg gagaggacga ggcagtgtca gcggtgcttt 

3 0 721 gagaggtatt gttgggatgg gacacttgat gagagggatc ctggggtgtt ggatccgacg 

30781 ttagttttgg atccgggggt gaagtttggg ttgtggaatg ctacgaatcc ttattgatgc 

30841 tggttttgga gttacagggg gaggctggtc tgggaaggta tagtacttag gctttgtgga 

30901 tgaaagaaat atgtgatatt tggcaagttc accctataga cccatctgat ctttttttgc 

30961 tacgaggcgg aggaccatgt cccaacataa cttcctaagt ggtaaaatga attctggatc 

31021 ttcctttcgg acccaatatc cttctctata gacgcagcac aactttccca acctgcttac 

31081 cctgccaaac atactcaaca gcctcctcag cttgctcgaa cggaaacacc ttatcaataa 

31141 ggtcatcaag tagcgtctgc gaagccgaga gcgcctcaca aaagtcctcc atatcaagct 

31201 tcgagcccgc attaatgccc ctttccaggt cagtcaagtc cgcattccac tattcggcat 

31261 cataaaactc acctaaggtt gatcctccgg tcaataagag tcggaaccag ttcgctcagc 

31321 tcagatggat cctgctttcc aagatatccc acctgactga ccacccctcc acggcgagtg 

313 81 cagcgcatac tcttgaccag ggagcccgtc cccccgttct ccaccaccaa gtccacgcca 

31441 acaccaccag taagcttcaa cacctcttca tcccactctg gtgtgttttt atagttaacc 

31501 gtcaagatag cagaaccagc atacttctcc ttaatagcct gaagcttcgc atcactcgac 

31561 gaagtcagaa ccacacgcag accagccgca agcgccaact tcagcgcaaa catactgacg 

31621 cccccagtcc ctaccacatt catcagtatt cctttccctt cacaccccaa gaacaccaat 

31681 catagagaca aaaacactca ccctgcacca aaaccgtctg cccaattctc atccccttca 

31741 acgccgacca cgccgtaacc ccggcacaag ggatcatcga cgcctgcacc caatccaaat 

31801 aagcagggag tttacacagc acactctcat catagacgat ataatccgcc ataaccccat 

31861 cttcgtcggc tgcgagccac gaccgcttcg tcgcacgctc tgttatctgc tcggtgtctg 

31921 taattggcgc cgcccgatcc ccaacagcca gagtcttcac tttctctcct acggctatta 

31981 cttctccggc ggcgtcgttg cccaggatcc cgtgcggcag cacaggccat gggttcccgc 

32041 cattggcgat gttggcgtca cgatagttga gggcgacggc gtgcacgcgg atcagaactg 

32101 ccgtggggga tagtgggagg gggagtggct ctgttacgat gttcactttg ggggtgtggg 

32161 gagtgtagtc gtctgttcgg cggacggcgc gacggtgggt gatttgtgcg tgagacatct 

32221 ttttctttgg tggatgtaat gatggatggt gtatctacgg agggtgtaat gattgtttat 



322 81 acgaacttat atgttcttct tttgcaatgt accccgcgtt ctggatgctt ttggttgtga 

32341 ttcggatggg acagccgagc cgatgggagc ttatgaatca atccatgcat gaggtttccg 

32401 agatccgaca tccgagacat taatcgggtg atgggatata ttatgctcgg gttctgtatg 

32461 aggctaaatg tatggatatg aaattatata ctagaatcac attgtctcaa aagtatatgt 

32521 cctcggagac tagggaatgt gctttagccg tcacgacagg ttcacccggg tagcctgcca 

32581 tcagcccgcg ataactattc gtaaacagac ccagcaccgc cgatttaccc acacagcatc 

32641 atgctggcct tgacgtcaat tctaatataa tcttatcaca gcactatatt atgataacta 

32701 tctcctcgga gtccatcatt ctatccggtt tctggaagtg ccaggaatcg tcatatccac 

32761 gttaactgat cagccgagat gatataccaa tgatgtaata gtggccatgg acagacactc 

32821 cgttctgtat ataaacacca cctctgccaa gcaaatattc tctcatcaag ataatattca 

32881 cataagaaac ataatcttca ttgattctag agctcataac ccacacaact cttaataccc 

329 41 tccacctgtt acaaccactc aaccgctttt ccatcatggc cgaccaaccc cccatgggcc 

33001 gcgactttat cgatgcaaag aacgtcgacc ccgtcgacca caatgaccag accgatcccg 

33051 aagactctgc tagccagcct cgtttcccgg gccccggttc gagcacgacc gctccgtcta 

33121 cctcgaagcc gcattcgaac aagttgttga acaagctgga tccgcgatac gatagcgatg 

33181 ttttggagtc ggagaagcag gagaacgaga aggcgcagcg gaggggatgg gattgagttt 

33241 gctttgatga tggattgtca agataggaag agaggatgat ggcttatgat catgtttaca 

33301 tgtttttgaa tttctgaggt tattatactc gtacttaaat tatgatacta tgttcatata 

33361 ttgtacaata aacgaagaaa taataagacc tcaaggtgat tagtatggaa gggtagaatc 

33421 acgccccgtt gataatctcg ttggcaatct tctccgcaag catatctatt ccgcgtcagt 

33481 ctctgatcca catttaacca tcaggtagag ggtaaactta caaacagtag actgaggatg 

33541 tcccggcggc aagaacggga aggcagatgc atccacgact ctcacacctt ggactccata 

33601 cacccgagcc tgactatcca ccaccgccat ctcatcatcg gacgtgccca tcttgcacgt 

33661 gcaagcagca tgccacagcg tcatgacatt atccttgatc cactggagga tctgctcgtc 

33 721 cgactgtacc tccagaccag ggttgtactc attgccaatg atgactggcg ccatggcctt 

33781 gctctggaat gccgcgcgaa cacgcttaaa catggcgatg gcgacctcct ggtcggattg 

33841 agtggcgagc cagttggggt tgatgacggg gaggtcgtcg gtgtcagctg aacggagtgt 

33901 gatgttgccg cgagaggtcg gggtgatgag caccgccagc atagaggcgt actggtagcc 

33961 gtcttcgggc tggttgatga gcaggttgga cacgttgccc atgtatccgg caccggagat 

34021 gtactgtaca atattagcgt tcctgtagta tgaagcgcac atacaaggta tacacatacc 

3 4081 tccgcttcag gccagtcact gggaaaggta gccagttgct tggtggtctg gctgctgaat 

3 4141 gcactacgca gagccgccgg gatcttctcc caggccaggt aatccgaaat cgggttggtt 

3 4201 agagggccgt tgccgaagcc gaccatgttg atgaactggc cgacgatacc tagaaagtcg 

34261 ttggcgatcg cggtgaaggt ctgcacgttg acgcgatacg aaggcgcaaa gaacggatga 

34321 tcccacatgt tctgacccac gcccggacgg ttcgccacga cctgaatacc gtgttcctcg 

34381 agctggtctt ggggaccaac tccagacacc atgagcagct ggggagactg gaaggcaccc 

34441 gcggaaacga taacttcgcg gttagcttta atggtgtaga tgttgcccgc ggagcccttg 

34501 acttgcacgc cggtcgcctg tttctggttg ttgaagataa tcttcttggc gagggtgttt 

34561 gcgtacgtgg tcagtgtcga tggtttgttc gccaggaatg actcctcaga actactccgg 

34621 acctcgccgt tggggtcgat ggttgatgcg cagtactggc cgcccatcag tgagcccagg 

34681 ttgaagtcct gcgtctcgtt aatgccaatg gctttcatgc cgaggtccat ccacgtcgag 

34741 aagggcatcg cgtagttcgc gtacgacact ttcagcggac caccagtgga ttcatatgca 

34801 ctgggatcat agccagcagt ggcattcttg gccctgattt gagtgttggg gggagtgaat 

34861 ttcacgctgt tcttatagaa gggcagtgtc tggtcgaagg tgtagctgct atcgttgaca 

34921 gccgtggccc attgctccat ggactcgatg gttggcctgg cagtccgtta gaggactgta 

34981 tactgggttg tttataatga aaggagcgac aaaccgttga taaatcatga agttcattgc 

35041 agagctgttc aactgcgtta gctttgtccc ttctgaacga tatgcaccac ttacgatcca 

35101 ccccagcatt tcccccgagc atagtgaatg gaccggccat tcgcaccggg gacattctct 

35161 gcaacaaatc cccagtcgtg ggcagcagct gtttctggat ccgacccaac tggaagcacg 

35221 tccgcagcag gcacctcggc caccgactcg agttcatatg aacctccagc ctcaatcact 

35281 gcgactttgt aattcttctg cgccagtcga gttgcgatag tgtttccagc cgtaccgcca 

35341 cctacgacca cataatcata ttccgaatca ccaaacaggc cacgctcgtc gtagttgtgc 

35401 gaaagagtgt caggattggc aatggtcccg gtgaccatgc actgtaaagc gagtgacaag 

35461 agaacgtgac ggatcatctt agaagattag tatccaaaat tagagcgctg tcaagggact 

35521 tacaatgagt tacacaaagg agcgactggg atagacacgt aaagaacgac gcgcttggcg 

35581 gggaatgtgt acattccgtc gacatcaggg gttaagtatg ctgcatgatt ccccttcagt 

35641 ccgaccgagc tgaggtgcta cccatcattc gaactaggtt tcacctgcct gacccctgtc 

35701 cgagtgtatg tttacaatca cgattaagca cacggacata tggttcctgg tgccaaaatg 

35761 gggtcaggat gaaacggcca tccgggcggg tggagatctc ccggtcggag atgcagttca 

35821 atcaggaatg gccgcctcat taatcgggcg acgggcgcaa atttgggtcg cccacatggc 

35881 tgctccagca ataaatagta tagctggtcg cttggtcaaa acctaatgtt tgttgctcct 

35941 gagttctggt ctggttatat tgggtctact gcgttgtttt gttttccttc cggtcggatg 



36001 atgtgttagg gttcaacggt agtccgttat aacagctttc ttgcattgca ttatcgtctg 

36061 ttccgaatga ttatcttgag atgaggatcg tccgcaagac catgattgag atggtattct 

36121 ctacttctgt tcttgttcga ataagtggtc gagggcacca tttccttcac taaaatatca 

36181 ggcgtgtagg ccaatcatca cccctgtcta gagcgaattc ccatattgct attgtgatga 

36241 caacccagag ccacatacaa tgacctttag gattatgtta aatggtaata gatatagtat 

36301 aagccttgca aattaaggtg taaaaacagt gggatcagtc atgctatcga tgcgttggtg 

36361 gcgcctgcgc tttgtttatg cataaatcag cgtgttatag gcccacagtt cagccattag 

36421 tacaccataa tgcttctaga tggacgagaa tttaccaagt atcgacacga ttccatacat 

36481 ttccctaaat ttcgagattt gcccttgcgg actacttttt accaggggca tgctcaggtg 

36541 ccttgccagt tgccaatccc gaggcggctg cccagccttc ccaatgcggt aatggtcgcg 

36601 agccactctg ctgcaatttg aagacctgtc acatctcccc aacaccctca ttgactggcc 

36661 tatcttcaaa cctagtggcc atatcttccg atgcgcagcc gggatgtagt ttcctgcaag 

36721 gatctcatca aacgcactga tgttctcacg agcaagcaga gacaccatta tccagaatga 

36781 ttccctgatg actacgacag attatagaag accccggcag gctgacagac tgacgaaata 

36841 acgtcaagcc ccaagaggag ccatgtgtaa cgatgagata agatgacata caattaggag 

36901 gcaccatctc atcagccccg tggcaactga ccaagtggag agagagggca tcacatatcg 

36961 atcaatggcc gcaatcaggc ttcacagccg catgctactt catcccaaca ttccagagac 

37021 aaatccgagg cacactttat tcccaacaag aagcgacctt catgtgtcaa ggatctccag 

37081 ttgagtcaag cgctgagccg ttacgccaac caacgacggg ccttcttttc gagggatctg 

37141 caaagccgcg gcgatttccc ctcgttaagc ggcttaacga acccccaaac ttaacccgtt 

37201 cagaacatct gatgcccaca tgaaccactc cccggattct caatcacatg cgatcaacat 

37261 gaggaacttg cctttgcgtt cggagatggc ctcccgctgc agtgtgaaaa tcccggccct 

37321 ccagccctat atagccatcc accccctcct caccccacag ggatatcctc gcaaatgagt 

3 7381 tgagcaccac cctacctctc ccatcatgag cggtacacac cctaatccca aggattcgat 

37441 gaagtccacc tggcgccggc tggatcgcgc cgagtggacc atctatcact ggttctacga 

37501 gatcctcggt gtgcaccccg aacaccttga caaggaggtc cccgtgcacc aaaagaccga 

37561 gaagatcccc tacatgagaa catggtccca gcatgtttgg atcctctacc acgccttcat 

37621 ccccctcgcc gtccaccacg tctacgtctc gtacacaggc cagaatttta cccccgtggg 

37681 cgccttcttc ttctactcca tcgccttcaa gctcatcgcc atccaccaac tccaagtcat 

37741 gcgccgcatg ggccacgttc tcggcttcct agacggcgac cagcacggcc gtgacggcgt 

37801 cccggacgtc ggggtagcca aggtcgtccg gtccctgata tcgacatcaa ccttccgacc 

3 7861 catcatgacc gtcttcctca gttaccgagt cagtcaagca ccggcacaga tgagctgggg 

3/921 atggctaccg ctcgagatcg ggctatacgg aatcatcctc gacttctggt tctactggta 

3/981 ccatcgactg atgcacgacg tgggcagtct ctggaaatat caccggacgc atcatcttac 

38041 caaacacccc aacccgcttc tcaccctgta tgcggacacg gagcaggagt tcttcgatat 

38101 cgcaggcatc ccgctgatga cgtacttcag tatgcggctg atggggatgc cgatggggtt 

38161 ctatgagtgg tggatctgtc atcagtatgt ggtgttcacg gagttggcgg gccatagtgg 

38221 attgaggatg catgcgtcgc cgccgtcaac gctggattgg ttgttgaggt tgattggggc 

38281 ggagttggtg attgaggatc atgacttgca tcatcggaag gggtggaaga agagtcataa 

38341 ttatgggaag cagactaggc tgtgggataa gatctttgga acgtgccatg agcgaattga 

38401 gagtcgggag ggaaatgtgg attatgataa tacggttagg atgccgatct tttagtgtgt 

3 8 461 gggggagtag aagggaggat tgttggggat gaaggttgta tttatggatg acgggaaagt 

38521 ttggtgattt attgtccatt ttgcgggagt tgtatataat gatgatgtct atgggatgga 

38581 gaagttatga atataatgcc ttgttttgtt taagatacca tccaggctcg acatggggtg 

38641 caggaactag atagatagtc gatagaaaat caggggcact gccccgcatg cctccgcaag 

38701 cgaagaccac caaaatagcc gcgcagtcat cacatcccag tgatcagatt catcaacatt 

38761 ccaagattcc atcagatcac atcacctgat cagcatactg cacaatgaca tcccagcgaa 

38821 ttcacctcat cggactagga agcatcggga ctctgatcgc tcactcgctt cgctccctcc 

38881 ccgcaccccc gcccataacg attttactcc atcgcgaatc agcataccaa gactttctct 

38941 ctggggggag caagctgcgt ctccaaattg gagaagacgg gccaacgcaa gagaccaccg 

39001 ggtacgacgc cgaactccta cagggggaat cacccatcca gcacctaatt gtagcggtca 

39061 aggcgagcgc aaccatcccc gctctcagat cgatccagca ccggctcgga cggcattccg 

39121 tgatctgtct cttccaaaat ggcctagggc aaatcgaaga gctcaatgag cgcctattca 

39181 cggacccgct tacacgaccg acctacatgt ttggcattat gcgccatggg gtctacataa 

39241 aagcgccgtt tgaagccgtg ctggcggggc tgaccggatc atgcgctctg gggatcgtca 

39301 acaatgacca acaggcatcc tcattatctg cttcttcgac agcatccgcc caatctcgat 

39361 ttctcattaa ccggctggtc gaagcaccga tcgtccgaag ctccgaactg ccatggatgg 

39421 agctccacca ggcccagctg ctgaaactcg cgaccaactg cgtggtcaat ccactgacgg 

39481 cattgctgga cgtgcggaac ggctcgctgt tagctaactc cgagctgcag gagatgcgac 

39541 gtcaacttct ccaggagatc tcgctggtgt tccgtcgtct tccggagttt cagggtctgc 

39601 ccgatgtgga ggaacaattc tcggtggcca gactcgagga gcagttgatc gggaacattg 

39661 agaagacagc cagtaattcc agcagtatgc gcgaggatgt acgagcaggt cgagcgacag 



39721 agattgcata tatcaatgga tggatagtgc gacggggcca gcagttgggg ctggactgtc 

39781 ctacgaatcg gtgcctgacc cagctcattc tggctaagag tagccaaggg cgctaagccg 

39841 gcgagaattt ggtgtgggaa gggggcactg ggtccaaatc aaacaagcat aagcccagca 

399 01 ggctgaatgg cccacagtcc gttttcagtc cgcggtatca gggttagacc agcaataaac 

39961 cagcaatcgc tagatagcta ctactctgct tctcatgagc atactagtat gccacagatc 

40021 cagatatgtc ttggataaat gatagtcagg ggtcattgat ggatcgtttg tccaaattga 

40081 gcagtgtgtc cttcagcgtc aggcacctgg gggccattgg gcatggcgtc cactactaga 

40141 agcggatcga tagtaccaga ctggtctcgc gggcggatcg aaccaggcga aaattctatg 

40201 caaatcctgc aaaaagattt gcaggcctga ctaatgactc gtgcgcatga cgtcagggag 

40261 gcggagtctg gggcttggcg ttcgaggggc tctgcgtggt gtgcctcagg cagagttcac 

40321 tctacccctc agaactctgg caaacatcag gaatatccac gagatatgcc atctgctcat 

40381 tcaattgcta ccaggaaagt catctacacc ccgttcaagc agacttccgc ctgggatctc 

40441 ctcctggcaa gcccccgatg atcattcaat cggcaatcgc aatgaggaat ggcagccgtt 

40501 cgcggtgggt ctcccgagtg caggacggat aaatcggatg ctggcctagg gtgccacacc 

40561 agtggaggca agcaagctgt tttcgtttat tattcaggct agacccccat cgctcagcct 

40621 ttgtctgtcc ctgttggtcc ctttgcgtgc aagcacatcc tgttatccct ggcttcaggg 

40681 cccgtcgtcc agatcagacc agatcctgct ccctgctggc tgccaatatg tctcccatcg 

40741 acgagcaact agagatggcg gaggtcgatg ctcggcccca tactaatggc gcgagcaagg 

40801 aggaaggcct cgagaggcca tcatcctcag gtaccactgg ctcgaccggt tacagtagct 

40861 cgcttgaata caatccccag ctgcacaccc ttcattcgcg gaacaccgaa ctcgatctag 

40921 agcgtcatcg cacggccact tcccatgccc taagcaggat cgaaacccag cggcttcagc 

40981 atcagttgac cgtgggtgaa agtgtcaagt ctcgcgcctc ccgcacgcct ttgcctcgta 

41041 tgggcgccaa caaggattac ccaccacctc ttcccgatcg cgaggactac gtcgtcgagt 

41101 ttgacggtcc agaagatcct ttataccctc agaactggga attggggacg aagtatgcct 

41161 ttccaaccct gtcgcgcatg aaagacatgt tggtcgaccg aatactgact tatgttctcg 

41221 cctaggattt acattagcgc catcctcgcc ttcaccagta tctgcagtac cttcgattcc 

41281 gctatcttta gttcctctac caccaacgtt gcgcgggtat tcggtgtagg tattgaagtc 

41341 gccaccctgt cgagttcgtt gtatatctgc ggttatgcgt ccggtcccct ggtgtgggct 

41401 cccctttccg agctgaaggg acggaagcca cccattgtgg tggctatgtt gggattcggt 

41461 atcttcaaca ccgctgtcgc cgtcgccaag gacctgcaga cgatcatgat ctgtcgcttc 

41521 ttctgtggtg tctttggcag ttgtccgctg gcggttgtgg ctgccgtttt ctccgatatc 

41581 tataacaacc gcactcgtgg tgttgccatt gccatgtttt ccagcacagt gttcttgggt 

416 41 ccgcttctgg ctccgtttat cggtggtttc attaatacgt cgtacctggg ttggcgctgg 

41701 accgcctaca tccccgcgtt catgggctat gccgccttcg tgttgaacat gttcttcctg 

41761 aaggaatcct atccgccgat catcctggtc tcgaaggctt cagaactgcg ccgtcgcacc 

41821 aagaactggg gcatccacgc caaacaggag gaaatcgagg ttgacttgcg ggagttgctg 

41881 gtgaataact tttcccgtcc ccttcggttg ctctttggcg agccattgat tctggcggtg 

41941 acgatttacc tcagtttcat ctatggcctg ttgtactgct tcttgacagc ctacacactc 

42001 gtcttccagg gcgtgtacgg tatgtctgct ggagttggcg gattgacgtt gtttggcatg 

42061 gtggtcggtt tgttcattgc ggcctcttat atcattttcg ccagtcgcgg gtacaacaag 

42121 aagttggaag cgaatggggg cattcccgtc cctgaatggc gtctgccgcc tgtgattatc 

42181 ggtggtgcgc tcttcgctgg aggactattc tggttcggtt ggaccggatt cacgaagagt 

42241 atccattgga ttgtacccac actgagtgga ttgttcactg gacttggtct gttgatcatc 

42301 ttcatccagt tgtttaacta tctgatcgac acctacctta tgttgtaagt tgccttgcct 

42361 tactcgttgt tagtggaccc tgttctaatc atcattccag tgccgcatcg gctatcgcag 

42421 cgaacacctt ctgtcgttcc ctagtggcag ccagcttccc cctcttctca cgtcaaatgt 

42481 ttaacaacat gggcatccag tgggctgcca cgctgctggg ctgtgtggcg tctatcctgg 

42541 tgcccatccc ggttgctttc tatttctatg gtaaaaacct gaggaagcgg agcaagtttg 

42601 cgccgtttta tgacaaggtt gctgagtctc atgcggagga tccagtggag aacggagtcg 

42661 cggagcacta aacgcttcct taataaacaa aagttttgca tatagctatc tgggcggttg 

42 721 cattttttcc ataccctgta cattattaat gttaatgtgg ataatcagaa gacaagtaat 

42781 gatcagacgc gcgttccatg tttcctagta gtccagtcct gtgttgttcg gtgttgagat 

42841 gttgttggcc gcagatgcta caggacccag ttgcttgctt attcctttac cgccatgcct 

42901 gcaagtctca cttctgtagg taataaaatc agcggattta tgtaatcaag gtgctgtagt 

42961 gtagtggtca tcatgaaacg ttgtggtttt ccgcagcgcc gtttcgacct gggttcgatt 

43021 cccagcagcg ccaaaactgg cgactggctt gattacttct ttttacattt ttttgtttag 

43081 attagcgaca ctttcgtgct ctattccccc tttcctttgc ttttatactc attgttccca 

43141 ttgtacgtgt tacactgggt gttcttgctc aatcgtacaa tatatgccag ctggtcttca 

43201 tctagccatt tgcttaatac aaggacgacg acaagcgaga ttattttcac ttcgcatact 

43261 tatagtcgca gctttgtctt gctattgcga gagtccttgc tgcctaataa taaaccagac 

43321 cctgaataca cacacacaaa gctaaataaa cttccagggc ctcctccgca caggcatatt 

43381 cacgtgcacg gctggtgcct ggaggcccac tcaagctccg ccaatcaaat cacaactggt 



43441 tcggcaattc ccccagagct caggcagcag cagccacggg agcctcaggc tggtggattg 

43501 cacctcccag tagatctccg cctcacgctc acctctatat aatgatgttt gtcccatatt 

43561 tctggtcctt cttctataag catcatataa caagagcctt ccatcattcg tgactactct 

43621 cccacgcctc taattatcct gactgaaatc tgacgacata attgctcttg tccgctactc 

43681 cgctactcac actacctatc atctcaattc tacgggaact tccctccact cactcccata 

43741 gtctaatact tcagcacact cacaccccac atttcactca ccatgaccgg ccaagctttc 

43801 aagggcttgg tcatagccgt tgctgggtcc ttttccggtg gctacaaaca aggtgagcat 

43861 gtctgacgtt actgccagaa gccacatact gattttactc acttcaaata gcaaatctga 

43921 agaccatcat ccagcgtcat gggggaactt tcagctcagc agtgacggaa gattgcactc 

43981 atctagtgac cacccaaagg gaagtggaca acaagagcgt caaatgtgag taccaagcag 

44041 agccctgtct acaccttctg ctgacaattc ttagacacgc aggctcgcaa ggtctacacc 

44101 tgcaacatcg tctccttaga ctggcttgtt gagtcagaca gtgcaggaaa gaagttggat 

44161 gagaagaaat tcctgatggg ctccgacatc aagaaagacg atgagccgga aagtccaaag 

44221 aagcgcaccc tagaacaggc tctgggaatt aacgaggatg gcactacgaa gaagctcaag 

44281 gacgcacaga cggtcggcac caaacaaatc aatgttcctg tcgatgatac atgtcctctt 

44341 cgatgtaaga gacaccgaat ttggatagcc taaagggaaa gctaacgtga atagtgacct 

44401 tcaccgtgta cattgatccc acaggcctca tctgggatgc gacactcaat cagacgagtg 

44461 cgaccaacaa taacaacaaa ttctaccgca ttcagcttct ccacagaaac aacgaattca 

44521 ggacttggac ccactggggt cgggtaggcg aacatggaca acacgctttg ctgggtggtg 

44581 gcggcttaga tgaggcagag tatgagttca agaaaaagtt caaggacaag tccggcttga 

44641 cctgggaaaa tcgcttggat ccaccaaaga agggtaaata caccttcatt gaaaagaact 

44701 atgaagagga cacggaagac gaagacgagg acgaagacaa agtcgtcgcg aagaaaccca 

44761 ccaagcccaa ggcagaggaa gtgaaatgca ccttatcagc acccgtccag gacctcgtgt 

44821 cattcatctt caacaaggat ttttttcagt ccaccatggc ttccatgtcc tacgacgctc 

44881 agaaattacc cttggggaag ctgagcaagc gtactctcca gaacggtttt caggcactga 

44941 aggatctgtc agagcttatt gctaatcctg ccctggcaag tacaaagtac gatactagct 

45001 ttactgctgc agtggagcat ctcagcaacc tatactttac cgtgattccg catgcattcg 

45061 gacgcaacag acccccggtg ttgaacaacg acaatctact caagagagag atcgaactgc 

45121 tggaggcatt gaccgacatg gaagttgcga acagtatcat gaaagatgcc agaaacaccg 

45181 atacagtcca ccctctggac cgtcagttcc agggcttgaa tatgcaagag atgacgccat 

45241 gtacgtctgg tccctgtagc cttgccccgt tgactagtac taactctaag cagtggaaca 

453 01 cacatctacc gaattcatcg agctggccaa ctacctcaac cagtctcgcg gtcacaccca 

45361 cggcgttcaa tacaaggttc gtatctgatg caccattttt aatgaaccaa gttaaccgta 

45421 acaggtcata aacatcttcc gcatcgagcg ccaaggcgag aaagaccggt tccaatcatc 

45481 catgtactcc aatatccaaa actcctgtcg ccgtctcctg tggcacggtt ctcgcagcac 

45541 caattttggt ggtatcttaa gccagggtct acgtattgca ccgcccgagg cccccgtcag 

45601 cgggtacatg tttggcaaag gtgtctactt tgcggatatg tcgaccaagt cagcgggtta 

45661 ctgtttctct tgggggagtg gaaacagggg tctgctcctt ctttgtgacg ttgaggtcgg 

45721 caatcccatg tatgagcggg acactgcaag cttcaacgcg ggccaagaag ctaaagcgga 

45781 ggccaaaatt gccactcttg gtcgtggtcg ctcgatccct ggtggatgga aggacgctgg 

45841 ctgtgtcaat gaagatctga agggggtgtt gatgcccgat gtgcggatgc cgaccactaa 

45901 ttctaactcg agggggctga tgtacaatga gtacattgtg tacgatgtgg ctcagatcca 

45961 gcagaagtat ctgttccatg tcgacatgcg gtagactgct ctttgttctg gattttggac 

46021 tcacggtatg gcgcatatat cagcttagcg gcggcatttg gggcgggtta tggcgttcag 

46081 cgaattctct ggatttaatt tatgatttac aggcccggaa ttaggggcta tgttttagct 

46141 gacgcacatg catagctagc atatcacatg gcaagctcga tcgggtataa taatatgaat 

46201 ttgattcttc ttagctgaca agtatgaatc atctcacgac atcggagcaa cagaggcagg 

46261 gtcaagtgtg tctaggatcc atctcaatat cgaagatcct tcaggaggaa gaggagaaag 

46321 atagggcacg taggcagcag cagcaaagga tctggagggt gtctggctac agtaaatatc 

46381 tccacaaatt ataatccccg ttacaagtta ggtatgcagt ccagccaaca cacgatcgag 

46441 tttgttgact agggcccacc gacgatcctc ctcatccata gtactgtctc caccgatgat 

46501 cctctccccc ctatgaagac ttagaaggct gcttaaccgg ccgagatagg cattcgtgag 

46561 aggaaaacat cctggaaagc acaagcaaat gacagctaga ttagagaatt taacaagaag 

46621 aaaagcacgc atgcggttca gtgaggggtg aatggggata acacatagaa ttgcaaagga 

46681 atatattgaa gaatcagtct tattggatgg cctggtttta catgcatctg ccccaaagcc 

46741 gattccagcc tgcatgagaa ggattcgggc gcttataaat gagccggggt caccgagcat 

46801 ttaagaacta ctactctcca tcccctcacc cacatccctt accgcaaccc cgcaaagcga 

46861 atctcatcct ctctcattat gctggtctac cttaccaact tcttcatagc atggctcgcc 

46921 tacttttggc agccgaagaa gcaaccctgc ggtagtttaa ccctgccaca tatccccggg 

46981 gttgacatca tatccatgac gcgccaagag gtccataact acagcgtagc aacctccccg 

47041 ccatacaatc tcgcagacgt acacaatctt gacttctgca acgtatccgt cactctgact 

47101 caccctggtg aagatgacac cgtctccgtc tccgtctggc tccccctaaa caactggaac 



47161 ggccgtttcc aggctactgg tggcggcggc ctcgcagccg gtatcattgg cttggcccca 

47221 gctcacccag tcagccaggg ctacgcaaca gctgctaccg atggcggact aaccttgaat 

47281 ggaaccagta atccccagac gggggcctgg attctgcgcc ccgacggttc catcaacact 

4 7341 gctctgctac tgaacttcgc ccatcgctcg atctatgaca tgactatcat cgggaagact 

47401 ctcaccgagc gtttttacgg atccccgcca agatactcct actggagcgg ctgctctacc 

47461 ggcggcaggc agggatactt tgctgcagca aagtatccga atctcttcga tggagttcta 

47521 gccggtgcac cggccctcaa tttcccacgt ttgatcgggt atatgttctg gccgccggtg 

47581 catatgttcc attccgcagc gccgccacag tgcgtctttg acaccttttg gaaggcaatc 

47641 atagacgaat gcgatccact tgatggagcc acggacgggt tgatatcgga ttacaatcca 

47701 cagagctgcc cattcaagcc cgaaacactt gttgggcaca cggtaacctg tccggagatg 

47761 ggctcagatt ccccagtgac catcacagcc cagcacgcca ccctcgtcaa gcaaatcctg 

47821 cagggccctg atctacaaga tcaccccgac ctctggacag gccttccacc gggcgcctca 

47881 ttccgcggaa cggcaaacac tcaagtaatc aacggatcca tcgtcaggcc cgtccccttc 

47941 ttccccatca tcggctggat taagaacttc gtttatcggg acccggacta caacgtattc 

48001 gacatgacct tcgacgactt caacaccgcc taccgcctca cgctggacgg ttacaatggc 

48061 attctcggat cggatgatct gaatctctct gagtttcgtc gtgcaggggg gaagctattg 

48121 acctggcacg ggttggcgga tgagttaatt ccagcgtcat ggacgaacgc attctgggag 

48181 gggattgata agacggacgg tgcggatgtg gatgagttct atcgggtttt ccttgcccct 

48241 ggcctgggtc actgctcggc tgggcatgga ccgaaacccg ttgatctgtt gggcgcattg 

48301 gtgcgctggg tggaggaagg gattgcgcct gataggttga gtgcggcggc tgtcaaggcg 

48361 gatgggaagg aggtgacgag ggagttgtgt cggtatccgg cgacgctggt gtacgatggg 

48421 catggggatg tgaatgatgc ggggagtttt tcgtgtaggc ctcgggaggg gtgaggctta 

48481 gggagttata tagtatagat atatactacg attgtagtgg tatttgtggg ggcttagaag 

48541 tgggacttca gtactagctt aagactgatt gagttgggat ctaacgggat agaagatgct 

48601 tcggtatttg atgctatgag gcgtttaacg aactggctaa tttcgggctt caatttggtt 

48661 aacattgaat gggtttacaa tagtgtgctg ggacacatac tgatacggat tactttgacc 

48721 acagagctgc ggcactcaag caagaattct cagtaactgg gggagtatga aggagtagca 

48781 tctctgggta gtgataggat gaagcctcga tcgtagagac aatctcggga ggagtggata 

48841 cagcaatggc aaaagacaac aacaaagaaa aaaaaaaaac acaaaaaatg gagctgcggg 

48901 gaatcgaacc ccggacctct cccatgctaa gggaatatta taccactaaa ccacagcccc 

48961 atttgatgaa agacaggggt ctacttttgc tctataactt agctatacta gcaaagcagt 

49021 ggtctgaagt gataggtcgt catggtgtta gaaatccaca tgggggacag ttggtggtca 

49081 tagattcctt agtataatat gcctgattat atggtttctt agacagttgt tgactgggat 

49141 gtttggatgt ggacgatgtc gaatggcata gttgttcagg taggcaactc atccagcgga 

49201 aatatattcc tcgtacccgt gctggcttcc tcgcgaggac caaggtggaa aaaacaacat 

49261 tcccactaga agcacactat ccaccacaga tctacagtcc aatgccctct cagagtaccg 

49321 aagtcagttg tacaagcacc agcgaccgat ttagtcatga taatgatgcc tgcccctcga 

49381 ccatcgggga gagtgaagag agccggcgac ataaccaacc tgtgattcag taccaacaga 

49441 tatgatacga agatacattg accaatactc actgctatcc aagtgcagtg cactcgagca 

49501 gttctcaatt acatcacgaa ccagcgagaa caccgacctg cccagccaga gatcctcacc 

49561 actgcgggga agcagggtaa gcatcagcga cagacaatcc tgcgacaaga ctcagctcag 

49621 ccaccaagaa acgcccgata atcacaaggt tccacacctg gcttgcagaa gcgaccgtag 

49681 caagatcacg atggcgttct gcaaccttga tcggaggccg cttgtccatg ccacaccaac 

49741 cacaacgcat acccgagagc tgagcagttg aactaatgca ccatccatgc agcatctctg 

49801 atcgattgtg gaagcgataa cgcagagacc aataaaaaat aaattaatta tgcaacatcg 

49861 gcacaacccc tcctctagct gccctacagt ataaagcaga cagagctcgc tctgctagcc 

49921 cctagcccac ttcgctcctt gatgccgtta tgctcagctc ccccatcgcc accatgctag 

49981 tcacaatgtg gacggggggc ttttattgcc tttaccagtg tatatatttt cttcgccgtt 

50041 cttacccgca agagcaagcg tccgcctcta cctcccggtc ctcgacggaa acccatcgtg 

50101 ggtaatctct gggacctgcc cgatccaagc cagcaggatt ggcagcactg gctgaagcac 

50161 aaggatcgct atggtatgac tgggtgttga actgacttgg aaatgtgtgg catcctgtct 

50221 gactgcgaaa aggcccaatc agttcactca gcatcatggg ccaaaccatc atcgtcctga 

50281 acgacgcccg actagcggta gaattacttg agtcacgatc gtcgatccat tcctcgcgtc 

50341 cgcaacaaca ctttgcagag atgtgagaca acaagaccta cataagtagg accgcacatc 

50401 aactaatcaa ttgtacaggg cgggttggaa caacgtcctc ggagcggtga agcagtccca 

50461 gcgcttccga gcaacgcgca agaacctgca tagggagatt ggctccaacg tttcggtcgc 

50521 tcgattcaat gaaatccaga cagcggaggt gggtcggttt ctactgaggg tgctggatgc 

50581 gccggataag ttgatgaagc atatacgcaa gtgggtcgaa accaccttcc tcccataccc 

50641 cttcatcgat cgcggagact aaagattaac caccagagag gcgggcgcaa tcatcctaaa 

50701 ggtagggtat ggatatacca tcgaaccgca tgatcaagac ccattggttg atctggccga 

50761 caaagcaatg atggattttt cgatggctat gcttccagcc acatgggcag tggacttttt 

50821 cctaccccgt aagccatagc ccgcagtcat acagagacaa gccagactaa ttggattatc 



50881 agtgaagtat cttccatctt ggtttccagg cacagaattt atgaagatcg cacaacgcta 

50941 tcgaaagaat gtcacggcat ttagcgatat cccatacgca ttcgtgaagg aacaaatgcg 

51001 cactggccgc tttgttccct cttttctctc caacctcctc gagagcagtg acctagagcc 

51061 tggatccgaa gaagagaaca cggtcaagtg gtcggctgga tcgctttacg ctgggggtgc 

51121 agatacggta aggagcgctt gccttgtgct gattcgatcc gacatgcata caatgccgta 

51181 ctaacaattc atcaacagac cgtatcctcc atagccagct tcttcctcgc aatggctctc 

51241 ttccccgaag tgcagcgcaa ggcgcagcag gaactcgaca cggtgattgg aacagatcga 

51301 ctgccgcagt acgcggaccg ggaacaatta ccttatatca atgcgttagt caaagagacg 

51361 ttccgatggc atccagtggt acccatgagc ctcacccaca cctcaaccgc cgatgatgta 

51421 tgcgagggtt actttatccc taaggggtca tcagttcttg ccaatatatg gtcagtcccc 

51481 ctttcaccct atgacccata cattttacca ccaattaaca ctagtgaaat catatagggc 

51541 attcacccac gacccagcag cctaccacga ccccatgacc ttcaaacccg aacgattcct 

51601 cagcccaaag ccagagcgag atcctcactt ccttgtgttc gggttcggtc gacgagtctg 

51661 tccggggcgg actctggctg atgtgaacgt gtatctcact gtcgctcagg cacttgctgt 

51721 gtttgagatc tcgaagccag tgaaaaacgg gaaagttaag gatgttcagc cggaatttct 

51781 gcccggtgtc atcagccatc ctgcgccgtt tgatgtgtct attcggccga ggagcgcgaa 

51841 gcatttggaa ttacttcggt cgttggagca aaagtatcct tgggagaaga gtaatgcgga 

51901 ggatctgaag aatatctagt attagggtag gagggtggtg gtgaagatgg tttcgtggtt 

51961 ggggaagccg gaatagtggt ggctagggaa gttgtatata gaccgagaag agattatatc 

52021 atatttatgg cccgcttaga gtaagcggat atcttactat gagatatata cattatcctt 

52081 tacatttttc aggcgtaagg tgggatagtt gtataccaat cacaccgaca agcagatgca 

52141 gacgcattca ggtcttcgca tcttcattat tcggccgtct gagcccctcc atcaaccact 

52201 cgtcaaacag aaattcggtc gcagtcggcc gatcctctgc attccattgc aaggccttcc 

52261 gaagccatcg gaagaaccct tctctattct cccctgggat atccgcacct aagctttcca 

52321 aagatatatc cggaattggc gctaagtcct tccaattacc tagagataac aattagatag 

52381 aacttgggtc atagagaatc ttggtcttac cctgttcatc ccagaagaca taagctaatc 

52441 tgcaccgctc ccggaaacta gccggaggag gtcccatgag cgctatcatc tccgcaatat 

52501 gcactcggtc atcgaagatc ccatctgcat ttctgccgtc gaacatgtgt cgaggaatta 

52561 cgatgtccca tgcctataat gaaaggttta gcatggaggt agggcggagt atatgtctcg 

52621 agagggtatc ttaccaccat cgcaacattc catatgtcca ctttgtagtt ccagttctcc 

52681 tttaagacaa cttcaggagc cctatacata ttcggcataa tgtcgtctct tttctcgaca 

52741 tcgccaaagc gggcctcgcc gaaatcactt attaggggta ggccacttcc cccggggtta 

52801 tagacggtag agagatagat ggtcctgtcc ttgagaatct ttctaggtga cggatcgttg 

52861 acttctcgtt cttcgagagt ctcgagagtc ttagtattag ggactgggag gagcaaattc 

52921 ttcaattgaa gatcttttcc tggattagct cagcagtcca gattgtacag gtgggggagt 

52981 taccagtatg tactatacga gcagcagagt gcagaaagtc cagagcgatc aatagctgcc 

53041 tgatacaagt cttcatctcg tccagagtca tagtatttcc agggatgtat ttctgcagga 

53101 gcgtatctgc actcatgccc ataggctcat gaacgagaca aacatgaggg ccgtggggcc 

53161 cggtgacagt gaaatgatcg aagagcctgc gaacgagatc tctccctggg tgctcaactt 

53221 ctacagcatt catacgattg tagatctcca gctcatgatt cttttcttgg ccagtgacat 

53281 atatcttaag cacaacatat ttggagtcac tgagatctca cttttgttag gggcagcgtg 

53341 gccatgaagg aggcctttca gacgtaccga aggtcgcgac ctaaccagac agttgatgtc 

53401 actccgtaac cgagtttggc aagcacctga tacctgttat cgagtacctc tccctgttgc 

53 461 accgggtagt acttttctgg atggtaagtg gggagagtct cttcctcaat ctctatactg 

53521 ggatcaataa gggcatctgt ggtgggaaat acggtcaagg aggatttcca ctgcgaagag 

53581 taacggattg tttgatatga cgcgcgtgtg atgttccgga gatgtggtgt aggggtaggc 

53641 atactatggc gacaggagag gtgtagcttg gcgtgaagca atccgagacg acgggcgcat 

53701 tgggtgatgc gtcctgccat ggcgatagct gctgttcaag ctaactatct tatctagata 

53761 gatgatgtta ttgacgggtc ggtggagcgc gagaccccat cggccatcca gaaaggggat 

53821 gggtgctgca taaatgacat aagcagtatt acttgcatat gcataatgta tatcgatgtc 

53881 ttgggctgac ccttttgaga gcctggtgct cactagagtg aatacgttac gtggagagaa 

53941 tatgtgagtg tgatcctaca ggaaaatcaa agaaagatgg aatataatgc tatattcgtc 

54001 cattcacaaa gtcaggacat gcacaaacca gacaactgag gcctgataag agccctttgc 

54061 tggctgctat acaataattg aaaaagatca cattttcctc ttcattaata cacataccga 

54121 aacctcgtgt tttctttgtc tgtcagccca gcccacttgg cttcatccac cattgaatcc 

54181 tccaccttgc cataccggcg atcccgtctc cggttctccc acacataata tgcgaacagc 

54241 acgacagaca tgacggctgc cgctgcctgc gtggccagca ccgcaatctt cgcgggacga 

5 4301 tattcagggg catcacgggc ctggaaggtc tggggcccga tgatgttccc gaccgagaag 

54361 gagccggcga tcagtgcact gctgaaagcg cgcttggtgt gaccggcaca gttggccatg 

54421 gtccactgat acaggatggg gagggtggca acaatggcat tgaccaggta aataccgatg 

54481 aggacaccgg ctttgttgct cttagggagg aaggagagaa ggccaccgcc gataatgccg 

54541 ggaatagtgc agaaggcgtt ccaggcccag cggtgcgagg ttctgcggat accaaagccg 



54601 acgaggaggg tgaagaagat actgacaatt ccggagggca tgttgagaag ggcggagatg 

54661 gggccggagt atccaaaccc ggcgatgaga gtggcggagt aggtggttac aacaccgcta 

54721 gaaacagaga tctggcacgg ggttagcttg gaagtgaggt gtgtgatgga cagggactca 

54781 tcttacgagg atggtgatca agaccaatag ccagagttgg atgtcaagca cgccttccca 

54841 gatctgctta aggttgatgg cgctgctcca cactcccgtc tggttgacac tgacgtgctg 

54901 caggagagcc accttttctt cctcggacag ccacttggcc ttcatcgggg tgtcaggaat 

54961 gaagagcaga gtgagtacac caatcagtac cgtgaccaag ccaatgacta ggaacatgat 

55021 gcgccacccg gccagggtgg ctccatggtg aacatgctgg aacccaaagg ataccaggcc 

55081 accaatgatc tgggccactc ccaaaccatt ataccagagg gtgaagcgag gtgcctgctc 

55141 gctttttgta tagtactgac tgctgatcag catcagcgac gggccaatcg tggcttcgaa 

552 01 gatacccagg aagacacggg ccgtcaggag ggtcgcatag tttttggctc ctgcggaggc 

55261 tgcagcggca actccccaaa gagcgacatt tacaccgagc cacttggcgg cggggacttt 

55321 ctgcagacag tagactgatg agagaaaggc gtcagcgcac tattcatgtt tgttgaggca 

55381 aacgaacata ctattgggag cttcagcgat caaataggcc aggaagaacc acgttgcggt 

55441 gttggtgtac tcgttccccg cgaggtgaag ttccgttttc atccccatga cggcagcata 

55501 ctatgtcccc ccttgcgtca atgaggggtg tgttggcagg aggcttgcga cgacttacgt 

55561 tgagcatgac cttgtccaag aattgcagga catagcacga gaacataaag ggtatcagat 

55621 gccagtcgat cttccgacgc aatgcccgta ggtcgatctg agggctgtcg gcggccgcat 

55681 ggccctggag gtattccaca gcatcgtcgg tgcgatccac aggcagagcc tgagaaaagg 

55741 cgcccggcag cttctcctcc atgttgaatc tatagatctg tcgacaaggg accagaagca 

55801 aaagaaaaga aaagagacaa gggggaaaat tgcgggaaaa taatgaaaga acaagacaca 

55861 gtcagggaaa ggaatgccgg gatgggaggt tacatgcagg atcctgcagg caaaaaacca 

55921 cgggaagttc aaccggattg gccgcagggg tgccattggg ttccagtagt tataatatat 

55981 ttgtcaagat aactgcctgg ccacttcatc ttgcagaggc ctgcagatgt tccaatgtct 

56041 acgctagact agggttcgaa atttagtgcg cgcgatccag cccggattgt tctggtcgcc 

56101 gaaggatttt gattgcgaac ccaccaggaa tcatcccgtt cactacggta ttcttggcct 

56161 gaccgaagtt aattgctgaa ggattctgaa tccgaatcat gttccagctc tgaggaccaa 

56221 atttgcccct tgttgtaggg cttcagcatg atcctgcgaa gcccaggatc gaaaaatccc 

56281 ttcaaccagc gcacagcacg acttgcataa tcactacgca aatatccctt ctccctcatg 

56341 tctctcccct ctttctcctt gaccttctac atttactgcc agtatgggct gggtgcacca 

56401 tgcctcgcct gaagtggaag cggagtcgca atatcgcctc attctagggg tgtgcctggg 

56 461 cttgaccctg ttgatgatta tcacagtctg cctgcgttta gcaattcgat tcaacgcccg 

56521 ccgattggaa gcgtccgatt ttgtgatgct ggtgactatg gtgggttacc ttcggtggat 

56581 agacgatccg accgctaatc ggtgccagat cttcagcatc atatacagcg cattgtgcgt 

56641 cgctcgtgag tcgaccaagc cttgagtcca gtctaagcgc tcactttcaa cagagagcag 

56701 gtacggcctc ggattaccct tgtcgctgcg ccccaaggca gaccttccgg attacacaaa 

56761 ggtatgcgct gtgcctgaat ttcagtcaag acgaagtatt tataaaaaca ccaacagctc 

56821 aattatgccg gtcgcccatt ctaccaactt ggaatcgcag gattcaaagc ttccctatgc 

56881 ctcaattatt tgcgattgat ttccggaaca tccaagaatt tctaccgaat ccttatatgg 

56941 gccgtgatag ctgtctcgac acttggccat cttgcgggaa cattagttct gatattcgac 

57001 tgccagccgg tgagagagtg aaatattttc cagtcgcagg gcaagctggc tgatatgacg 

57061 atcgaaaggt tgagcgcgcc tggaacccca acatctcagg gtcgtgtctt ccagcagggc 

57121 caacattcta tggattagcc atcttcacca ttatctgcga catcaccatc atcatccttc 

57181 ccattccgct gttgcttcaa ctgaatatca agaccgcaca gaaagcggga gtagtttgtc 

57241 tgttcctgct ggggctgttc accacgatct gctctatcct tcgcctcacg caaatccatc 

57301 gggtggcata tggcgatgga aattcgacca tgttggtgct gtggggcaca attgagttca 

57361 atgtcggggt aggtttccac gagcacgcga aataggatta tgcaccgagt cactgactaa 

57421 ttgtattaga acattgtaac ttgtgtcccc ttcctcgctc ctttgctgaa aggtgcagtg 

57481 cgcgatttcc gatcgtacag tggccggaaa gggtacgata gccggagcta tgctttgcag 

57541 acctggtcca aagacccgcg ctcgcagtta cggtcgacaa cttcggcggc gccgcaaccc 

57601 aaacgcacgc ccagcgaaga gcttatcctc gagagtgggg gcgtcgacga gggaggcatt 

57661 catatgacgg tggagctgcg cgtgtcgcta gagaagaggc caacgacaga ggtcagagag 

57721 tgatgccagt actcttgcca acgactgatg gcgccattcc ttagaactaa tctcttagac 

57781 tttcgttatg tatattaatc actcactggg tcttcctttt tcttctcgtc ctattgtgag 

57841 ccactttccg gcgatttctg aaatagtgca atgcaggaat cttattaggt gcttagcacg 

57901 acgggattaa tagtgcttac gcttctcagg atcaaatcag accaggatca tactgggtcg 

57961 atgtaacaga cacaagccaa tggagtcgag cagaacaggc caggccaacc gtccccgacg 

58021 ggatcttccc ggccatcaag cctagctcca tccaggaaga atcctgtgcg tcccgtgaaa 

58081 ccttaatggt ggtccacccg gacggccagc cccgggcggc gaccgagctg tgtcccgccc 

58141 acgccagctg gcctgttggt gagagatgga taatctggcc gcctagtcaa accagtaact 

58201 ggatattagg gtctggccgg tgtgctaagt atctgctaga gagatggcac cactatcagt 

58261 acggttcaga tccactacgg acggacagtg acaggttgac accggagaag taactgtggt 



58321 ggtagggcta gagaccggga ctaaggcctc aagcacgacg ccgtccccca cgcgattcca 

58381 gcctaacatt cttagtgctg ttaaggctct tttagggccg gccttgtttc acctcacgca 

58441 ttagaggggc acgaaacatc caatgtaact ttattgcgtc gtgattacgg tttaattatg 

58501 attaaattaa ggttggcaga ggttttcatt gcctgataga gtcctgtatc ccagctgctg 

58561 gtcgtcaacc ttaccaaccc tgttcgttgt tcttagcagc acggatacga gtctctggct 

58621 ccttgcatat ccaacgacat ggtgatggca tttctaacag tgaccctcag ttatcagaca 

58681 atgcggcatc aatcttgact ctcaccgagc attcccccct gatctaactg gaaaagataa 

58741 gcataaatta ttcagagaca aacacgagcc ttagcaccat caagctctct aagaccaacc 

58801 aactacggag tagtatagtt tcgtcggcct gaccccatcc ccaacaatgc cttacattct 

58861 catcccattg gcacattccc aatgatcatg gttccaccaa tccaattcct gcgcacctgc 

58921 cacacgtgtg accgtccgat tccagccggc accccgtccg gcccttgatt tcagcagtat 

58981 tacagcgatg cgtcttcagg taaaagggtc aggggatgaa cccgtgttcc cgtggaaccg 

59041 aacatccgtg cctcctcgct gcccacatca tggatgatct tcccgttctc atcgtcggcg 

59101 caggcatcag tggccttctt ctggcccagc atctccagaa gctcggtgtc ccttacaaga 

59161 tcttcgagcg cgatgccgcc atcgacgcac ggagcggggg ttgggggttg acgcttcatt 

59221 gggcgctccc tgcactgcgg gaactcctac cggaccacct cgtgcaacgc ctcccagagg 

59281 catatgtcaa caaggcggcg gctgctcgcg gtgacaccgg tcgcttttcg ttctttgacc 

59341 tgaaaaccgg atctgcgttg tacagcgttc cagctgcgga gcggattcgg gtcagtcgtg 

59401 ttcgattgag gcagctggta gccacggggc tcgatgttca ggtaagtctt tatccactac 

59 461 cttggtgaga tggaatggaa tgagttcaaa cctttcagac cactctgacc agaattgtag 

59521 tggaacaaaa ctctccaaaa tatcgagtcc accgccgata ccgtcaccgc gcatttcgca 

59581 gacggaacct cctacacagg atgcctcttg atcggctgcg atggatcgcg atcccccacc 

59641 cgcgagatct tgtacccgga cagccatgag atgaatccac taccggtgca gatcctcggc 

59701 gcagctactc tgtatacagc agaggagatg gcaggggctg cagagattga tccattcata 

59 761 tttcagggct cgcaccctga atctaacgtc tttctcttct tcagcagtat gtatatatcc 

59821 ccattctatc acaagtccca catacacacg atggctaact tactatcctc tatcttagta 

59881 ctcgataccc cgaacaactt cgtcgaaagc agcaaagaca aatatgaatg ccagatcata 

59941 ctatcatggg cagattccaa ggatatcgca gtccccagcg acaacgggga gcgaatcgcc 

60001 ctcatgaaat ccttggcctc ggactgggcg gagccattca ggaccctggt ccatcggtta 

60061 tctgaggaca cggaggcacg gtcaatccgc atcgcggact ggatgttccg accgctgcaa 

6 0121 aatcgctcgc accccagggt tgtgctgatg ggtgattcgg cgcatacaat gactatgtgt 

60181 atgtccatat aatgctgtgc tatatgggaa tatacatgct gacagggata atttagaccg 

6 02 41 tggagaaggg gccaataatg ccatcgtcga tgtactggat ctgacccaac gtgtggatat 

6 0301 gcgttcgcta gggtgagtct atcagcggat gtactactgt ggttggagac aactaactaa 

60361 tcggtccaaa cagaaccatg tcgacacaag cgctgcgaga tgcattggac gcctacgaga 

60421 acgacgtctt caggcgcgca gagcctagtg tacttaactc gcgccaggcc tgtgtcgacg 

60481 ctcatgactt cacacgtatc ctagacgaga gccccctcgt atcagcccgc gtgctgaagg 

6 0541 aggataccac ggagcaatag tcttagccag agaatcatca cagtgttgcg gggaggcagc 

60601 atagataatt ccttcaatta atcacaaacc taataattct atgcatcacc agaacactgc 

60661 gtataagtac aaggtaccag acctatttca cagaccctac cccctttgca accccaacga 

60721 agaaaacaac atcgttatgc aaaataccca tgtacaacat ctcctctcag ccaacacaaa 

60781 tcaactagaa tatcaccgcc cacccaaccc ctcatgcggc gcacgaccga ccttgggttc 

60841 tccgggcttt ccacccttaa cagcctcccg atccccatca gaaacaaaca tcgagcgccc 

60901 aagcttattc tccccatcaa cagcctgctg gtctccctcg ggagagaatc ttttctcctt 

60961 gcttggcgac tgcatccgga atgtgagcaa ggcgccggag ataaccgcga acatggttag 

61021 gaaggcgaca ttccgggacc tatccatata tgtatattgg ttagaatgat gtccgtatat 

61081 gatttacttc ggtatcgagg gggaaggagt ggtgcttact taggtccgga tgacatgcca 

61141 tttccgtaac tcagacttgg gttcggcata gtgggtagga tgatgatgat gatgtgtgta 

61201 tcatttaact caaaattggt tattattgtt aggagatgat acttgttgtt caggtcctct 

61261 tatatttgaa tccgaatagt cgccacgatg acacaaccat gactgcgatg ggtgtaggtt 

61321 tggccagact tgcgtttagc tttggtgggc agcagtgtaa ggtttgggtg ggttgtggat 

613 81 ggggaagatg ctcccggcgg ctgagctcgg gtgtagttta cttcatctat tgttgctatt 

61441 gtgatgaaaa tagattgagt tgtttgcagg atatggtatt tgttcagact aagttatact 

61501 ccaaattgtc acaggatcaa tgctcaagct gctatgagcg acggtgattt gaatcgaccg 

61561 gtggtattgt gttcgggttg aactggaatt gtgaatggtg tagggaccag tgaatgagaa 

61621 ttatatgagt tcttcctctt attgtcagcg agaaagagga tggttgtata tcggctacgg 

61681 ggaaaaaaaa aaagaaatga tgcaaaaaga tatgacaaga gtgggattcg aacccacgcc 

61741 cctttcgaga ctaggaatgc tccgaagaac atgcgagatc agatacacct gaccctagcg 

61801 ccttagaccg ctcggccatc ttgcctgttg cctgaaaaat gctggctcct tttgttatga 

61861 gagctcttac gttcttcttg acgtgtagca aaagtagtcg caatggcatg cttcaaaacc 

61921 atgtctgatg ctacgaggtg catttttatc actaggtaag atgtacctat tgtttgaaac 

61981 atagttttgg tctcaattac tgttattaac cgtctagtat agtgatcatt cgatcataca 



62041 actagtagta gtctgaccgg gccattgatc tgaactgata tatatatatc tacacagtgc 

62101 ggaatggggt tccctcaggt gctcgcagca aaatccacag ttcaaccagg tggatgcgct 

62161 cgtcgagaga ccatgcaatt caactgtgtc cgtcacatgc tccacggcag cgttaacagt 

62221 agatatgcac gagtccagac gatcatttag tcgcaaagat caaattgcaa cgctacgaga 

62281 aactatcgaa tatagagtaa cagtattggc tgcttgcgac cgagcgcgat ttcctacggg 

62341 acctgaaata ttctcaacaa atgcagtcag agatgactta acacgcaatc tcgggccgga 

62401 cagcattaat accgaacgga gataggctca ggatgcgccg tgaaacagtg attatatacc 

62461 cgtttaggaa actcataagg aggctgaacg cgatcgccag caattcagtg ccccacagaa 

62521 gagattggct gtgccggctg gttgatgatg acaaaaggat tacggggtat gctgcacatc 

62581 atatcataat tcgctatcag cagaaataga acataatatc tatgatcgac cgcagagaac 

62641 agactaaagg ggtaaagggg ccaggaacag gagagcttct ctctgcttgc ttctccactc 

62701 ctcaatgcag gtaccagaac agttatgaga aggtagatag ttcatggaac ccaccggtcg 

62761 ggtcatcacc gaattgttct tataaattgg acaaattttg agtgcacaca tcgaggcgag 

62821 gatcggtgat caacgtggac agcaactacg agcatagcca gaacttctta gccaaattca 

62881 gagaacagat aacgcgcata tatgcactag ccagccagaa agcgcagaca caaatgaaac 

62941 cctcctctga tagtagctag tagcaactgt acaggaagaa ctgtgacgat ggtgcaccaa 

63001 atcttgtact agtcagaggg gtatcggggt gaatgttgag tattattgag aagcaggtat 

63061 gattgctgat ctcaacgacg ttcgaactat tgagaaatcc cagctattgc tgtcagcgct 

63121 aagggggcgc ccggggagaa aagagtggcc aaggaccgcc aatctgcgcc ttttgcttgg 

63181 attgactcag acgcgtcatc ccagttctga cagtgcgaga tgtgattagc agtgccattt 

63241 caaccgcatc acacactcca gcacgattag ttcggaaacg agcacgagcc aagagggcca 

63301 aaccccctca tggcatgtta tttagtctaa cgctcgctaa tttatttgat gcattgatgg 

63361 tctgaaagca acagcccgaa ttgccaggag gcctgcctag tttgcccacg aagggctcgt 

63 421 gcaccacgcg atgcactcaa cggtcgaaat aagagtatga aaggatcaaa tatattgaat 

63481 cctcccgttc caaccggata atttgctctt ggcgtatgag gggaggtgag gggaaatgag 

63541 gaggcatgag gggaaataca cacccgctga ctaactgaac ccctgagctc cctgagctct 

63601 gagcacgagg cataggatgg agctattcat agatgtcaat gagtggcgct tatgctttct 

63661 ttgcaatctt actaggctct tgtcgccatg gtgggatgag tcccaaccaa ttactcggga 

63 721 ggccgtctct caaatttctg cctgggaggg acactagtgc cccagggact ataaggggtg 

63781 gtcgacgaca atcgagcaca tgtgatagga tcccccagac ctcccaaggg cttggtaagt 

63841 gattgaggcg atctttggaa taggcacggc ggggcgacag tccgaaaggg aggtaacccc 

639 01 cgatgttgtc tctcgactgg ccagtctaaa aaacgccacc aggccagata cagtactagc 

63961 cacgcaggga tttcactcca ccgcttgcca gtcatttgaa caaaggcatt agaaaaaggc 

64021 tatcgagctc tggtacactt tttgtgcttt tttttccagc tctcatgacc acccccgcct 

64081 cgcgtcggtt cgggaatctc agacgagagt ctaaatctct tcgctgtgcc gcgccaattc 

64141 taaaggacac tgacacggaa cacccaatca gaatataatg aactcgatac acaagacatg 

64201 ccattagccg agtaggcgtc cagccaggtt gcagccaagg ccaacgattt cgtttgttga 

64261 gaatgtgttg gaccaccaca atgggatcta ggtggatgtc actgcgcggt tttcttttcc 

64321 atcttgcgct ccacgagtga tagggctgaa gaacactgtc actgtgaaca gactcagaac 

64381 tagcgacgac tttaagagga ctaaaaaaga gcgcaaacca gcgcacagaa aggatcatcg 

64441 tgctttccac gattctacgg catacggaaa gtcctccgta tgtagtccgg cccgcgtaat 

64501 ccgtaaccat ggatccaggg ccgtgtctga ctctgtcatc cacccccctg gagatcccct 

64561 ctccaccggc ttagattgat cctccttctg aaagaggcaa ccccaaagca gaccgtgctt 

64621 cgaaaggttc gaagcttctt tatcctcccg ccttgtaacc gtggcgcggt atgaggtgat 

64681 caccctaatc atcccctttt ctcttagcgg gactccgcac ctgatagtgg ttgcgataaa 

64741 gaacaggacc gcaactgtag atccaggagc tgaggagcta tttagaaacc atcatcgttg 

64801 gtgcctgcga tagcgcacaa caagtgccac catgagtgcc acccagaaac gtgtccggag 

64861 gcggagaatt tttacaggcg cccagtttta tcgccaccac cagccaagaa taacacggaa 

64921 acgtagcttt ccgacataat gcttacaaag tttgttgtca atgctattct tcctcactcg 

64981 tggccagata cgaggatcaa cgccaggaac caagtaaaga cacgaagcaa aaaaaaagtt 

650 41 tggaaccagg gacaggcccg gggatacccg acctatgaag caaagcaaag tagcacaagt 

65101 aaaagtacca gctcggcttc tcgcgttttg cttccatggc cgaggatgaa acccggagcc 

65161 actttctgac tcgcttgccg gcttccgact ggcgttggcc gccccacgtg ggcctggcag 

65221 gaggtttact attattcgat attattcggc cttggttttg cacaagccca cacaagacag 

65281 gacaaagaac aaagaaaaca aaaaaaataa taagccacaa aggcaaaaag tacggagcga 

65341 cgtgaaccct gatcctttgc caagacagct actgagatag cttctcatcc taaggcttcg 

65401 agtggttctg atgccgagaa atcgaaacca tcactgcttg tagttgtctg atgccctcaa 

65461 acataactca cgatggtctg gtcgccccag ccggggaccc cagccagcag ccgtagtcat 

65521 tatcaaccgg ccgataggga ttttcatgcc aataacaagc ttcggtggcg cttctagggt 

65581 atccctctag cgactagaac cattgggacc gtcgcatgta ggtcgcactt tgagccgtct 

65641 ccaggcggac tattgccctc ttataaagct tggttgagac taggagaggc gccgttcgcc 

65701 gcgcaggttg agataggcag ctaaatgagc caagctcgct cccaaaactt cccaggtgga 



65761 gatggaaccc ccccagtcgt cacggaagat gataactcct ttcggcacat gcccgaaccc 

65821 gccccggtaa ttgcattgag tcgtttccgt cgaatccgtc gaatccacgg taccctcgat 

65881 cgacacttga cgcgacgttc ctaattggac ttagagcggg cgagggctcc tggccacgct 

65941 gcctgccagg accaacccca gggggttcga gggcacagcg gttatatagg ccaaaatgcc 

66001 gtcggtggaa aggaagcctg aaacaattcc acccccggtc acattttccc catcatgctg 

66061 tccctggcga tccagtcctt gctgttcgag gcagacgccc atgcggccag tccggccctg 

66121 cagcgtcgct cgtcctcgtc ggcaagcact gagggcgagg ctgtcatgag cccactgccg 

66181 agtttgcaga ccaagggctt gatcgtaatc tcggcgctgg ccaccgtctc cctcatttgt 

66241 actctttcct tgttgacttt ctttacctat cgttttatct tctggaaacg ccactacaaa 

66301 cgttacatcg gttacaatca atatgtcgtg ctcatgttca atctcattct ggctgacttt 

66361 atccaaagtc tgggctttat catcagtctg cgatggatcc agaccaactc ggtgaccgcc 

66421 tcagatcctg catgcttcct gcagggaatc tggctgcaga ttggtgatcc tatgagtggt 

66481 ctgtttgtgc tcgccatcgc cgcacatact tttatgcacg tcaccttggg ctaccagatc 

66541 tcacaccgtc tcttcgtctc catcatttgt ggcctgtggc tctttggggt tatcacgact 

66601 gtcatcccga tcgcagcaca cggtcggtac gtgtggtatc cagctgtagc ctgggtaagt 

66661 tgattctttc tctcacatta cttttttctc tttcataagc cactaacatt catctaccct 

66721 agtgctggat gacccccaag tatgagtcga tgcgtctgtg gactcactac ttctggattt 

66781 tcgcctctga attcttcacc gtcgtcctat atgcgatcat gttcattcag ctgcgcaaga 

66841 aaatcgccga gtcagccatt ctgggcgagc acaactctga gagtctgacc cgactcaagc 

66901 gcgttatttt ccacatggcc ttgtaccctg tggtgtatat ctgccttacc ttgccccttg 

66961 ccgcaggtcg aatggcttcc gccagtggtc attcccccag tgtgctctac ttctgctttg 

67021 ccggatcctt catgacgctg tgtggcttct gtgattccct catgtacacg ttgagtcgcc 

67081 gcagcgtggt gctggagccc gaggccagga tccacggcag cagcaacaaa tattccagcc 

67141 gaccgaacaa atcgtcggtc gcacaccact atggaaacag tatcgatggc aagggaccga 

6 7201 ccaacacgac gattgctcgc ggtcgcagcg attccaccga ggagatgatc ggcaaggacg 

67261 ggttggagct ggccccgatg ggagtggtgc tgcagcacac gacaatcgag gtcacccatg 

67321 aggcggcgta tgactcggca tcgagcaatc tgagtggcca acgcgagcga agcagcatct 

67381 accattaaac ttcctaagct gtcatgacga ttttgcaact tctatatgag attaccggcc 

67441 gaacatcgag cgccgggttt atacaatttt aatcaagtcg catttgccat cctactacag 

67501 aaccagtcaa gaactgtgtt ccctttttct ttgcatctgc gacaaacccc gtctcgccac 

6 7561 cactatgcga gcatcacgcg cccacaagct atcgtcgctg atcgatatgc gaccgacgcg 

67621 cgcatcgtat ctgaaggtcc ctccaccgta cttcagagga ccgaaaccct cactgggtta 

6/681 gcgacgaccc agcccagcca ctgactggcg cggccaaccc acatgcgccg tgaaagggac 

6 7/41 atccagccca caggcaacaa aatgccagtc aaaatggggg cgatttcaat ccagatcccg 

6/801 atccagtcac ccagaccaga cagactttgc aacgctttga ctgcaacgct ttgacctgcg 

67861 ctggccacgc aatgaatgaa atggcgccta tcctggtaat tttccggggt taatgattct 

67921 ggcggaggac agcaacatct gccatctaca cgtcactgta catatttcct atcagtcatg 

6 7981 atgcaagtct aaacgaacat aaaatcaaaa agacaattac taccacaata cagttcacac 

68041 agtaaactcg gaacatagcc gacaagccaa taggaatgta ccccataatt gagttgtccc 

68101 aaccccgcac ggagacaaga tctaaggtta tctgcatctt gtcccccttg ctttgctttc 

68161 caccaccatc ttatccatct ctttttggtt tcccctgtcc gttatacaaa tatcatacat 

68221 acaatacaac atgtacatga aagagatata cacggactcc accccctcca ccctccacac 

68281 cttcatccaa caaaaccccc tgggcgtcct cactaccgcc atcccctcct ctacacaccc 

68341 gttactccaa tccacccaca ttccctgggt actcgacatt cctcctccca gcactgccga 

68401 taacagtaac acagtcaaac tccgcggcca catcgcccgc gccaaccccc aatgctctgc 

68461 catcctcgac tccctaacca ctcaacccga atccatcctc cccaccgaag tcctcatcct 

68521 cttcacctcc ccctaccaca gctacatcac cccgcacttc tacaccacca ccaagcccct 

68581 caccggcaaa gtcgccccaa catggaacta cgccgccgtg caagtctacg gccgcgcacg 

68641 gatctacaac ccccgttcag agggagagct cggcgagcag gcctcgatgt tcttggatac 

68701 acaactccgc gatctaacag cgcactgtga gggggagatc atggggtata ctaccaccgg 

6 8 761 tactagtgat acttatactc ataacaaccc ggaggagaaa tcatgtcccc gagcatggac 

68821 cgtcgacgaa gcgccagacg catacatcaa catcctgaag aagaatatca tcgggatcga 

68881 agtgactgtg acgcgaatgg acgggaaatt taaaatgagc caggaacgcg gagaagggga 

68941 tcgggaaggg gtgctgaagg gtttggatgg gatgggaggg gatgtcgcga ggggagtggc 

69001 ggggatggtg agggagtgtt gttgtgatac tagaccatga ataaataaag aggattgtgg 

69061 aatatgaggg ggttagaggc agaattacat caacgggcaa gatggccgag cggtctaagg 

69121 cgctagggtc aggttatctg atctcgcatg ttcttcggag cattcctagt ctcgaaaggg 

69181 gcgtgggttc gaatcccact cttgtcatat ctttttgcgt ggtttttttg tgcatgggat 

69241 tgttatttct ttgagagtag atacagatac atagattata tatcattgtg tttttgatag 

69301 ataagatgtt gcttagcctc aatattgatg aggtttcatg taacacaggg aagaatataa 

69361 tataattatc tccatacatt tgtactcgac tccacgagat agaatcacct ggacggttga 

69421 tgcacgcaca tgcacttgat ttctgggtaa ttgtcttcat gaattacgaa actatctcct 



69481 ctaaaataaa ccttcactgc atcatggtca aatacaagag agtatcaaac actggtattt 

69541 gagttgtatt gttcttctcc tcacctgaca tgtttcaagc atacagctcc caaacgacgg 

69601 tggtatagaa acaacatcac atcgggcgct gattcctacg ggtcccaggg ccacaacaat 

69661 agttcagatg ctatcatgaa ggcaggacca gttaggatgg gcttgtttaa atccccactg 

69721 tggcttacag aacatggtca gaccgttaga gagagcgctc tatcatgtcc ccaccagagc 

69781 tttctatatc tatcctcggc tttccgacgg ggctgtgcat attatattcg agtttagtga 

69841 ttgttagtga agtgaactag accggtcagt tgcatactta gcatggtccc gaatcgtcca 

69901 tgaacatgac atattggtgg ccgtttgcca ggagtgtggc agtattgtca tacaccgatc 

69961 aaggaagctt tcaagataag attcccacag agtctgacct tcaccgaact gccactgtaa 

70021 aaatatgtct tgaatgcaca gttgacgctt tgccaagctc atcgccccct cgtagaccat 

70081 taccccccgc gacatctctc gcatcacccc aacaagcctc gatgccaaca agccaagaaa 

70141 ggttagccga cactagggtc atctgctcgt cttctagtcg tctagtcatg ctgttaggcg 

70201 tcatagcagt gactcatatc cgactggacc tcagcctgat tttgagctcg agccaccagg 

70261 tacctggagc cagggctaaa gactgaagcg ctcacggccg atacggagtc cgtcggcaca 

70321 gtgaaccgaa ccaaagtatt agatagcttt atcttaaaga taaccgaact gggcaggtgt 

70381 agtagtgaaa tggcactatc atggctttag agaaatgaga gtatgggacg acttgtcgct 

70441 agcttgggtt ctcattatta gggaaaatca gcggagagtg gactgagcgc agggtagata 

70501 gatagatttc tggtggactg gattggtctg agattttgtc gaggctagta gcgtgtgata 

70561 tggtggaccc tgatttgact tctctcgtga ggtgagcatg gcattttaga gttctccaac 

70621 cgctgctttc ttccgagttt gaggggttac tctccatgcg tatagatgat ggctagtatc 

70681 ccatgtcctg catcaaaatc cgtcatatgg tcaagataac aagtccagtg caagtccaat 

70741 acccctttca aggcgagcat gatagcaaaa gttttgctta atataacggt agcactagcc 

70801 cctcgatcca ccctatacga agtggatatc gcagttcacg gccaggatgc gaatcagggt 

70861 cttgccaagc gctctttgat ctctaggcga acgggccgat aaccacgaag gaggcgctat 

70921 tgtcagggat taaattgcat tgtactgtgc aaaggcccgt tgaactaggc tgtgaaaccg 

70981 acaaactcaa atgccccaaa tgctaacctg gaatagtagc tgttctgttc ttgttccagt 

71041 cgaaggatgg ttagctaacc gcgacggctt attaactaac tctagtcgac cgcgcggatg 

71101 acagcttgtc attggataag ggggaacaag tccgcggtga ctaacaaaga agttttagcg 

71161 ctggacgaag aatcgggtgc ctccgcctgc atccgaggat gcgaggccac gccatcccac 

71221 tttgggtttg tcgtcactct ggctgggtgt gcagtgttga acagcattta gttgtgatct 

712 81 caatcgcaaa tggacaggag atgatattca tgtcataatc agagtgccat gtcagcgacc 

71341 atgtcttgcc aagagtcatc atttccggta aaaccgacgg aggtaaaacc ccggtcattc 

71401 actagttcga gggagggaat agtctcgcag gtctcttctt ctttcttcct cggggtggcg 

71461 gatggatcac ttcttcattt gatggttgcc tgccaacgac ccacttgact aggtttcccc 

71521 cttgatggga ttcgttggtt ttaccagaca gagcctcttc acagcccagc gcgtgttccg 

71581 caaaagcaac agcccaaggg tctgaaggct gacattggag cgggtctgac gcaggatggc 

71641 ttgctgttaa aaaaactcag gaacaggcca agacgaagcg accgcgcaag gctgagcagt 

71701 tctccgattt ccccgtgccc ctcctgcgcc tcccatgggc gacttttgat cccagatcca 

71761 tcgtttctgc accgaccata taagaagctc cgcaccccca aactcccctg gcaagttcct 

71821 gcccccaaat tacgtgagac agggaaccat cacagcaaca ccatgaagag caccactctt 

71881 ctttccttgg cctgggctgc ccagtccgcc tattccctct ctatccacga gcgcgatgaa 

71941 cccgctactc ttcagttcaa ctttgaacgt cgtcagatcg ccgaccggtc ccgtcggaag 

72001 cgatcgacgg cctcggccga cctcgttaac ctggtatgtt cccatcccga gtctcaaatc 

72061 aggggaatta tgacggtcgc taatgcaggt ttctaggcta cgaatcttgg ctacacgatg 

72121 aacctcacac tcggcactcc cggccaggaa gtcagtgtga cgttggacac cggcagcagc 

72181 gatctctggg tcaatggggc caactcgtcc gtctgcccct gtaccgatta cggctcttac 

72241 aactcaagcg cttcttccac ctacaccttc gtgaacgatg agttttatat ccagtatgtc 

723 01 gacggcagtg aagccacagg cgactatgtc aacgatactc taaagttctc caatgtgact 

72361 ttgacgaact ttcaatttgc cgtcgcatat gacggcgact ccgagggtaa gtcttcgcta 

72421 ttccctcact ttcattttac actttgctaa cggttttacc atgcagaggg ggtcctcggt 

72481 atcggatacg ccagcaatga agccagccag gccaccgtcg gtggtggtga atacaccaac 

72541 ttccccgaag ccctcgtcga tcaaggcgcg atcaactggc cggcctacag tctatggctc 

72601 gatgacctcg acgaaggaaa aggcaccatt ctgttcggcg gagtcaacac cgccaagtac 

72661 tacggcagcc tgcagaccct gcctatcgtc tccatcgaag acatgtacgt cgagttcgcg 

72721 gtcaacctga cggccgtgca ccttgagaag aacggcaact ccgtctcggt caacaacagc 

72781 gccacgcaat tccccatccc cgccgtgctg gacagcggca cggccctgac ctacatcccg 

72841 acctccgccg cagccagcat ctacgaggcc gtcggtgccc aatacctgag cgagtacggg 

72901 tacggagtga tcgagtgcga cgtcaaggac gaagacttca ccttcctgtt cgactttgga 

72961 tccttcaaca tgagcgttga catcagcgag atgatcctcg aggccagttc cgacatgacc 

73021 gacatgaacg tttgtacgtt tggcctcgca gtgatcgaaa atgaggccct gctgggcgat 

73081 accttcctgc gcagcgcata cgtcgtctac gatctcggaa acaacgagat ctccctggcc 

73141 aaggccaact tcaaccccgg cgaggaccac gtcctggaga tcggcaccgg atcggatgcc 



732 01 gtgcccaagg cgacgggggc gacggcgacc ggcgcggcag ccacatccac ggcctcgagc 

73261 gacaagtcgg acaaggagag ttcggctaca gtgccgcgca gccagattgt ctcgctggtg 

73321 gcgggagtct tggtcggtgt tttcttggtt ctgtaaatat agagatttca cgttgcatgt 

73381 tgatgataca taccatagat ttgcttctaa ttgcccctta catctcgaaa tgttttctgt 

73441 ttttagcttg atatcccatc gcttcaggtt tcagatatgg aaatgaaatg aaaatcggaa 

73501 tgcaagatct gatcgaatag ttgtttcact gccgcgtggc tggtccgatc ggcacgaagc 

73561 cagtaatcga caaccaatca tacattgtga tttcctgtca acctccttgc gactactact 

73621 aattttatca tatcatgccg ttgtgaccta tcctactcgg tgtccccaca ccgatgatag 

73681 ccacccgcga ccggataatc ggtaacctgc caagagtcga acggctttcc tgtggaaagg 

73741 gtggctgcaa cctgtttgtt tcagtttttg tgttgcagaa ttgactatcc ttctgacctg 

73801 tgaatgatac tgggatggtg acgtagtgaa aatgttggtg gagcgattaa ggtttgtgcc 

73861 tgtcagcctc gtcaagtcca tcatcatgtc ttttccggac tatcacttcc tctactctga 

73921 agtaccggga ctccccgcca gtattcgttt cgccaatcag agcattcgat gtgaaccgga 

73981 ttgaagagtg cccggtgacg gtcggaagca atcacagagt gtcctcttcc gttgccaaaa 

74041 tcacactatg gaaaaactac tgcaggtgag aagaagtata tctacgcgga accgagggga 

74101 gagcgaaaag aagcaacgct gaggataaat gagaatccac ctccaacctc gtcccagacc 

74161 aatctgcccc tcctaccaaa cgagaacccc cagacgatcg ctatgcctat cctcccccgc 

74221 cttcatctct tcgaaatcgc cgatcagcca tggtgtcccg acaaggccat cgaatatgtt 

74281 cagctatgcc tcacccactg ctggaacctg cgcctgccac cgatcgccaa ggcttcttca 

74341 gccgacgttg cctgcgacgt ccttgctgag aattttcccg acatctcgtc tttcaccttt 

74401 gtcgatttgg gctcgggcgc aggaggccca tcctccaccc tggagcgact tctcaacgcc 

74461 cgcctgcgcg cccagcatct ccccccggcc cagttcctgc tgaccgacct caacccccat 

74521 ccgcgcgaat gggctgccct caccaagcaa caagaaaaca tctcctacat ctctgaatca 

74581 gtggatgcga ccaaatgtga tcgactggtc ccacagatcc ggaaggaatg tcgaatgttc 

74641 aacgtggcgt ttcaccattt cgatgaccct cttgctatgc ccatgttgcg gagcgcgatt 

74701 gaatcggccg atgcattcat gtatgtcatc aagccccata catacacata ccttgacctt 

74761 ttgcttcgct agtatggata aacactgatg acgagaatta agtattttcg aactcaccgc 

74821 gcgtgatttc agctcgatgc ttatattacc tggcttagta ctgatcgcat tccagtacac 

74881 tctgcttcga ttctggcggt cacccttaca tttgctcttc acatttgtgt tgcctctggc 

74941 acctctacta atggtctttg atggctttgt gtccatcatg cgctgccgga ctcctgatga 

75001 gctgcatgat ctagtcagac ggtcggatgc ggctggcttg gagaactggg agtttcggtc 

75061 tggcaagtcg cctgttatgt atccgactac aaatgtgcat tggtttatgg gggtgaagaa 

75121 gacagctcag taattctgct cagagatggg gaagctgtaa tcagtcaagt ttaattgcta 

75181 tgtggtgggt tgatcctatt ctagtatcta actatctagt acccacacag aacttgtgat 

75241 ctgtgtttca acgatcaaga tgacgaggtg gcttagtctc atgagaacta ttgcaaaagt 

75301 gctgcaacca tcctcttcca aagtgcattc attcaaatgt ggtcgtcaac caggacaccc 

75361 tatagggagc ctcgtcagtg taggaatcag cggtcctggt ccacgcctat gcttttatgc 

75421 aaatattgac cacagagcga aggtgagtga gtcttcagca tggctgcctc caagtagctc 

75481 agaggtatac acctcgacag ccgatgccga gaccctaaag ttcactagcc gttagtgagt 

75541 gaatagtcac cacatcgacc catcattggc tgcagttagc ccacgtgctt atcctgcttg 

75601 cagatgagca cttggactac agggcattca tcatacaagg tcgcgtggct gacgcagcca 

75661 aatgcaagga acccgggaaa ggatcctatc cagcctgggg tacgacttgc gtctccacac 

75721 cccatgtaag cgagatgatg gctgtagccg agacatgaca gacaccctat ttcgtggtgc 

75781 cgtggctgat ttcaatcaag aagatgccca cgtgcatcga tactcttcag cagctggtac 

75841 actcaaacgg tatctgagca tgacatatac acaataatgc acaaggcagt gatgggcatg 

75901 tatacggctt tgcatttcat tgcttccccc cagagcatgc aatgcctaca aattggcacc 

75961 tcgccccctc ctgcctggcg ttaatgctga ctgaactgtt tcttgcaatt aaagtgggag 

76021 tatcaatatg cacccgacta ttgctaccct ttctatactg gcagtagcct cactagctga 

76081 ggctgttgcc attgaagctc gctcgcagaa cgccactcca ctgactccca gtccctcggc 

76141 taccacaacc gggattgatt ggtttcagac cacgcctgaa agctaccagg gtatgcagtc 

76201 gctcctcttt tggactctat atcaagctga cttcagaagg cacgactgca accggcgttg 

76261 ctcccttctt agccgagaca aaccctgctc cgtttggaca aaagaccctg tctccaaatg 

76321 acccgttgga aacaagtgaa ccgatcgagg gggctgctgg tcgcaatata ttccattaca 

76381 tgggcaactt aaggtacatt tcctagtgat gggttcccca tggccctcaa tgttaagccc 

76441 attactaaac acatacagcc cttaccatgt gcccgatggc ttcggagtgg atgaatatcc 

76501 tctcccaaaa ggatccaaca tcactcagat gcatatgatc catcgccacg gctcccgtta 

76561 tccttcatcc tcagaaggtc tggcctcatg ggcccagaaa atcatcaatt ctaccgccag 

76621 tggcaacaac ttcacgggcg cactttcttt cctaaacgat tgggactacg gactaggtct 

76681 agaaatcctc gtccccaaag gccgccagga gctctacgac agcggcgtcc taaacttcta 

76741 caactacgga cacctttaca atgcgagctc cccccataag ctggtagcgc gcaccaccac 

76801 ccaagaccgc atgctcaaat ccgcggagaa ctttctcgcc ggcttctttg gccttgaatg 

76861 gaccgaaaaa gccaacctcc tcccaataat agaaggagtt ggctacaaca actccctaat 



76921 cgggacttac tcctgcaccc gtgcactaga gtacatggca tacaacgcca ccacacccct 

76981 atcaacatgg aaaaacatat acctcaaagc gcgcaccgaa gcgctccgca ccctgaccgg 

77041 atcctacaac tggaccacca ccgacagctt caacgcgcag gatatgtgcg cctacgagac 

77101 catcagctat ggctacagcc aattctgcga gcttttcacc ttcgaagaat tcgaaaactt 

77161 cggctacgcc ttcgacatcg agttcgccaa catggtcggg tttgcctgtc ccgcgggccg 

77221 cgcgcaggga attgcctggg tggaggaatt cctcgcccgt gtggagggcc acttactcca 

77281 gactacaggc actaatgcta acatgacctt ggacaccaac cctgtgacgt tcccaacgga 

77341 ccagaatctt tacctggact tctcgcatga tgccggtata gttgctgttc ttacggcgtt 

77401 tgggttccgg cagttcgccg aatccctgcc tgccactggg ccccctctgt accagcagtt 

77461 caagagcagt aaaatcgtgc cttttgctgg tcggacgaat attgagatca ttaaggcgcc 

77521 gaatcaggtg gctgctattc gtcctgcggg tgatcagagc gatgcttatg ttgatggtac 

77581 tggagagacg ttttatgttc actttttgca gaatcagcgt acgcttcccc tgcattctag 

77641 ctttgaggaa tgcgagtatc gcgatgatgg atggtgcgag ctttcgacgt ttatggctgt 

77701 tcagaagaag ggtttggaaa ggtctcagtt tgagtatgct tgttttggga attggactgt 

77761 gactgagtat gggactgtta ctgatggtgt tccggtttga gatgcggcga acaaaatcat 

77821 gtatcataga gaaagagaga gagagagaga gagatagaga gtacaagacc tttgaatggg 

77881 agaagctagt atgatattct cgttttaata tatagtcaaa catgaaggat atgtgattct 

77941 tggaagcagc agcagcacta tatagtgaag ttgcatcgtt gtatacatta gtaatttgga 

78001 actggatggt ccgccgcttc aacacctgca tgccagcgat actttgcagc cggatgatca 

78061 gccggcgggc cgctctggcc ttctttgctg taaagatttt cacggtacgt tcccttcttg 

78121 acagcgtagt catcccagaa gaggcctctc ttacgaagtt cagggattag caggtcgatg 

78181 atatccttga aagaaccggg tttgatagca tacgcctggg acctgttagc caagcacatc 

78241 ctccattctg tcgagcaact caccaaattg aatccatcca catctgcctc attcacccac 

78301 cgttccatct catcagccac ctgctccggt gtccccaccg gtgtcgcgcc caatccaccc 

78361 accgttatat gctggcccac ggtagcctta gtccacttct cgacttccgg ggtagccttc 

78421 gaccagccct caaccgcaga cctatcacgc acccgttagc atctctccct ccaccatcca 

78481 cttaggatga taacataccg aatagcatta ctctccacat gcctcaactc ctcatcatcc 

78541 ccataccgat ccagatcaat gcccgtccac cccccaaaca gcgccaacgc accatcaatc 

78601 gaccccaagc tcctaaaata cctaaacttc tcctgtgcct cctcctccgt ccttcccagc 

78661 accggacaca acagcgccag aaactttatt cccgcagggt cacgcccgaa ctgcgtcttc 

78721 gccgtctccc gtatctccgc gatattcttc gccaccacgg acggactatg tcccgccacg 

78/81 aaaatcgcct cggcatgctg tgccgcgaac gccttcccgg ctttagatgt ccccgcttgg 

78 8 41 aggatcacgg gcgtgcgttg cgggctgggt tggcagatgt gcgggccggg cacggtgaag 

789 01 tatttgccgg agtggttgat ttcgcgcacg cgtgttgggt ccgtgtagat tccgcgggtg 

78961 cggtccagga ccacggcgtc ggatcgccat gaggattccc atagtttgta ggtgactttg 

79021 atgtattctt cggcttgggc gtagcggtcg tcgtgctgga agttggttag tacgaggttg 

79081 tagaatgatt tatgggtgga gctctgggtt acctggggtt gctgggtgtg gcccaggtta 

79141 cgggctgcgg agtcgagata gccggttacg atctggcttg cttagttttg tgattactat 

79201 tagctgggtg gggtgggtgg gtgcttacgt tccatcctat cctggtttat gttagtattg 

79261 gtggtggtga ggatggtcgt gtaggaagat gacttaccgt cccttagtga gatgatcaat 

79321 agtggagaga cgacgcgcaa gatgatatgg ctgctcgtaa gtcgttgtga cagtcactcc 

79381 gaagccgata ttttgtgttg cggccgccat ggcaggtacg actgctaaag gctcgttgac 

79441 gggccattgc gcgccagaga taatggctgg ttcgaggttg cgggggcctt tgtacacgtc 

79501 gtagccgcct gggaaggtta gtatgtatgt ttcaaaggtg cactgaggtt cagagttacc 

79561 taagacatct gcgatgaaga ttccgtgaaa cttggcggat tctagaagct ttgcgagctc 

79621 aatccagtgg ccgatgtcgt tgaatcggtg cgattcgtcc tctgggtgtc gccacagccc 

79681 tggggactgg tggccactac ctggggtatg ttagccgatg aagagaagag aagtgagtag 

79741 gatgatgcgt acacatttcc acaaatgcat tgaggatgag ttgtttgcgg ggtcgagatg 

79801 tagcgttgtc gatagccatt gtggaggata ggtaggaggt taggttagtg acaagggtgc 

79861 tggcttgggc ttatccgaac cggcaaaggc tgagtcttca atatgaaaaa taaaaaagga 

79921 aggccagtta attatagagt ttctggctca gagatcgact gtgaataagt atggaccata 

79981 agtgtgcaat tcagacctcg ctggagggga tattctgtag catcacagtg ggccgggatg 

80041 acgtgatggg gaaatgaggg gtcacccatt cacggggatc ccatattcaa tacaacccca 

80101 aataaaagca gggatcattc tgaatctctt tatctgaaaa ctggactcat cgatttgtgg 

80161 tagagattta caagacctca gtagttggac gaatacttgg cacaagccac cttcgggctc 

80221 ccggggcatt cacccctgcc ctatctttgg tcctgttccg ccaactatcg ttgctgtcac 

80281 gcaataactt aactttattg accttacgct atacggacaa tcatgcacgt atatcggccg 

80341 aataacttta aatttagata tattatcccc aggatcttgg aacacggtcc aagcacctca 

80401 tatgcatagg ctatcgtatc atcagggaac accatgttcg cagtagaata caatgatcgg 

80461 cggatagcgt gatagttatg ataatgtgtg ctgtgatgta gatcttaaag gtacatttgc 

80521 cggggtgcct ccacatagtc agccttgtct ttcttgttag ttgagtgctg cattgcctgg 

80581 ccaccggggt ttcttaacaa gatgtcagag attcaagctc gcgcagcagg agcatcagtg 



80641 caaccgccac tgtcccaggc tgttggatat gttatagtcg tggtgctagg gctcattatc 

80701 gccggtggta tgtccaccaa aaatagacga gtgatgcagc tgacaaagca gtgatgatgg 

80761 tcatcaccaa gatccttaag aaaacaacgg gcgaggataa caagaagact gagatgtatg 

80821 tccaaccgca ctcatgcatt tcacatgcta attgctcagg ttcatgactg ccaatcgcac 

80881 cgttcggacc ggcttgacag catctgcagt aatttcggta cgagtctgcc ctcggtagcg 

80941 atataacccc tactaagtaa acagtcctgg ttgtggacaa cagcaatgct aggagcatcg 

81001 tttgtagggt atgattatgg cgtggctggt cccttctggt tcgccgcagg ctgcagtccc 

81061 atgatcgtct ttttcgcctt gatcgggata tcctacaaga gaaagattcc agatgcacac 

81121 acatcgctcg aagtggttcg gatccgatat ggtaggctgc atgaaagcta tacttatcga 

81181 cttgatattg acgtgttcag gtcgcattgc acacgcggtc tttatgaccc tgtgcttgat 

81241 aaacaacata tttgcctgcg ctaacatgct tttgggcgct gctgctgtta tctccgccat 

81301 gtaagtgggc caccagattg acgaacattg ctgaccatat agaactggga tgcatatcat 

81361 tgctgcgaca tttttactgc cagttggtgt tactgtgtac accttcgtag gcggcattaa 

81421 agccacgtaa gtgccctgct gaggaagatg tacactatgc ttaccggaga agattcctta 

81481 cggactactt ccacacggcg atcattctta tcatcgcatg ctatctctcg gtcaaagcat 

81541 tcactttcga agaggttggc tcaattggta aattatatga gctcgtccaa gctgctgcac 

81601 agcgccaccc tgtgtcagga aatcaagatg ggacgtacct gaccatgact tcgaagggtg 

81661 tatgtactat gctccatgct gttgtctccg ctaacaaacg caggctatcc tgttcgggat 

81721 acttcatatt tgctccaact tcggcctagt gatcgtaagt ataaacacac cgacctgagg 

81781 tatcattaac atttctgcca gatggacacc agctacttca tcaaagcttt ttccgctgct 

81841 ccctcctcag tagtacctgg atacaccatc ggtgggattg cctactttgc cattccctgg 

81901 gctctaggaa ccattatgag ctcccttgcc ttaggtcttg aaaacacagc gtccttcccg 

81961 acatatcccc gcgtaggtac tcttcacctt tgatcccagg agatgctaat caatgcagcg 

82021 catgacctcc acagaagtga gcaacggcct tgtccttcca tacgcggcca tgaccattgc 

82081 aggaaaaggc ggggcagctg cagttctcct tattaccttc atggcagtaa cttcgaccct 

82141 ctccgcgcaa gtcatcgccg tcagctctat cctaagcttc gacgtgtacc gagaatacat 

82201 taatagggcg gcatcagaca gggacatcat ccgtgccagt cacttcggtg tcattttctt 

82261 tgctgcgttt tctgctgggt tcagcactat gctccattac gttggtattg acctgggctg 

82321 gacactatat atgctcggta agcctccact tcaaccaagg tcggtccaaa tacttatgga 

82381 aataggtgtg gtaacctgtc ctggaatctt tcccatggcc ttcacaatcc tctggcgtcg 

82 441 acagagcaga gcagccgcca tcctgtcccc aattctggga atggcgaccg gcatcggcgt 
82501 atggttgggt acagcacagc acttctacgg tgccgtgtct gtcagctcaa cggggcaaat 
82561 actaccttgc gtatatggaa ctgtagcgtc tgctttctcg ccgatagtgt actccgtcct 
82621 gattacccta gtcaaaccgc agaggtatga ctgggcggag ttccgcaagg agaagctggg 
826 81 actcgagcga ctggatagtg atagcgatat cacggtgaat ggtcaaggta gtgaagagca 
82741 gcaaaatcgg acctcgtttg atccccagga gctgaagaga tgggggcgca ttgctgcgtt 
82801 ttggtccatt gcgactttcc tgggccattg ggtgttgtgg ccattgccga tgtacggatc 
82861 aaaatatgtt tttggaaagg gagtgagtat ttgtcactgt tgcttcttgt gtgaatgtcg 
82921 gtggattgct aacagatttg tcagttcttc actgcttggg ttattgtggg gattatctgg 
82981 ttgtggatta ccatgctcgt ggcgatattc tatccccttc tcgatggggg aatgcagcag 
83041 atgttgcaaa tatcgcgggc actgcgggga aggagagagg cagtcacatc cacatcacta 
83101 ccatcagtga ataatgattc tcccgaggtc ttcagagtga atgagaagtc ttgaccgttg 
83161 tgtcaacttt tgtggaagat gccgagttat agggagatgc ctatgattgc taccctacgc 
83221 aggatcgaca gcaattcacc aatattaggt aatgattcat tacagtatat taaacgccta 
83281 gtataaaagg gccccgtctt tcgtcgtttg gttcacaagc acttcatcat caacataatc 
83341 gagtctttca tttatatata taaaccaatc ctttagatgc tatcatcctc actttacacg 
83401 tttgcccgaa cagcaaccat gtatgctgat gatcccctta gtctgtcagc gcttcccaga 

83 461 caagtttatt cattgatcga gacattcatg gttctgcttt gcggtgtctg caaaacaggt 
83521 cttccaaaga gctactgttc agtcgagaat ttgactgaat ttgcttgatc tttcgagaca 
83581 ctacagtcat tgtaacaagg cacccaccag atcatccgtc atttccgcta cctacgtggg 
83641 cttttctcct gttattcgta ccgagactgc ctattctgac tagccagagt cattttggtt 
83701 ctgaaaccac ttccttaacc tcgatatatt gaattcgttc aaactagccc cggaacttgg 
83761 gaattatttt tttaccatgg cagccacact gccactcgag ctactccata taattggaga 
83821 cagattacgc cagaaaggag tgccattagc tccctgtacc actgtcagcc gctggtggaa 
83881 ggacgttttc gaggcatata tatacattcg ggttgtggtc cgcagcacaa aggctgagtt 
83941 ggaagctcct cttcccggta tacctgtcga tgtattcaag agtttggcct cacaagatgg 
84001 aaatcctcga cattgccatc aatgacatac aaatttcccg aggtatttag agtgaacggg 
84061 aagccttgac tatcaagttc attttgcgca aggtctcgat gtgctggtag ttcccacggc 
84121 tgctgggaca gagatatatg ttacaaagga tcagccagct atcttaacta tgcgtgaagc 
84181 cagaggtatt gagaaaattt ggatatacca gttggcattg tgaaagacat gtggactttg 
84241 ggcaggcata ctctgtacgg cttactaact ggccaccgaa ttagctgcag cgctggcagc 
84301 gattcgtcta caggactata catatcaggg aatagcaacg atgctcatat aatgccaatc 



84361 tttagcttga gctgtttttg ctccatatgg ccttaagact ttccgcgctg atgcgttttt 

84421 ttttttttcc atcttcgaca agaacttcaa aatctcacag ttttgtgaca aaacatttca 

84481 atgaacgaac atttgggctt taaagtcgag aagcccggat ggcttgactt tgtgcaggca 

84541 gacgctggaa ttcgcctctc gtgcccacat taggcgagga aaaggtactc agtgtagggg 

84601 tcagcggcta cagttcgctt ttaagttaca cttcatatat atgagatact gtataaaatg 

84661 tgaatcttaa tagtttattt gaaaagtttc taggtagatg acaaccgccc aggagcttgg 

84721 aaactacttt cacaatcact gtcacagtgg ttcagttcag accaaggccc gaaatacaac 

84781 cgaagatttt cagacgggac aggtcgatat atacgtccat tcgtacatat gtcacggggt 

84841 caagtattat ggggtggttt taggtgtgca cgataatacg catattctca ccaacaacaa 

84901 tcgaattgcg tcgctatcac tcctcgacag gatgataacc atgaagttgg ctacaaaagg 

84961 caagaccaga gatccgagaa cgaactcgat ttcgcttact gaggcaccct atcagtagct 

85021 gttcatgact tccctactag atactagaag ccacacaatt ggttgatggt aaggaagtgt 

85081 agggatttgg acaaatcaca attctgtctt ctccatccat tgagcaagac ctctcttttc 

85141 ctgcggagaa aggacgcact tccgtctcgc aacgcgtccc ctcttgcggc aattctccag 

85201 tccgaaagta cgtcttaata taatgctcct tacacgccaa tcgggaagca aaagagcaat 

85261 gctacagcga acgtcaataa gattatcgac gatatacaag gaggaaccgg aagaacttac 

85321 accttcagcg tgctagactt catagatact tagtgcttaa cagtctgcga agatgtatga 

85381 agtcctcacg cgggataacc tcatataagt tcatgcgaat ctcaatgggt gcctgaatgt 

85441 atgaggttgc gaaaggtact acctggcgta ttaagaagcg cagccggagg gccgaactcg 

85501 actatctgtc cctggtcgag gaccagtacg aggtcataat cctcacgaac agcgtctttc 

85561 tttaagcgat ggataacctc gagcacggta tgtttttgca ggctgttgcg caggaggcat 

85621 cgcatggtac gttctgtctc ctggtcaaca gaactggtga cttcgtcgag gagaatgacc 

85681 ggaaacttgg aactttcctg acggagacga caaagcagcc tcgccaagca gaatagttgc 

85741 tgctgccctg gtgtgagtga cccgtcggta tcaatgtcaa gatcgagagc cgatttcggt 

85801 atgggaaagt gggcccctgc gtccgagagg ctttggtggt aaagctgctc gaacacctcc 

85861 cagatgccca cttgctgcaa ggactctcta gcataggtgt caaagtcctg tgaatcagca 

85921 tgccggcccc aggggtccat attatagcga accgaattgg cgaagagtaa aggcttctgt 

85981 ggcaggctga ggaaactgcc tcgaatagaa tcccgactga agcctgtaat gggttcatcg 

86041 ccgacagtca ccacgccacg ctgaatgtcc agcaaccgta gcagcgttcc cagaagagca 

86101 gatttaccac taaagatgaa cagtacctgt cagttactat tgtcgtatta ctcagttgaa 

86161 ggttgaagcg gcagcattgg cttacctgcc agttcggcca catatggcca ctttctgtcc 

86221 ttcccgaact gaaaaggtga catccttcag gatcatgtcc ccactagttc tatgaagaag 

862 81 aatgaactgt cagtttcata tctcacaatc ctcacaaaca aaccgagtca tacccgttga 

863 41 cggcagagac gtcctggaat gcaacgttgc cgtttgcaaa ccctgcttta tgttcaggcc 
86401 ggttggaagg gtcatcttcc ggtgtcatac tgatgttaaa ttgacgaatt cgggccgcag 
86461 ccccgataga agtctccaaa tcgacccaaa aagtcataac ataacgtata gtagaactga 
86521 gtgaaatcta attcgagggt tgaagatgtc cccgattagc ttcctgcgtc ttaaatggtt 
86581 cgaagggaag aggcaaggca tacaaggtta accaacgcta gacccagaaa gccaactgat 
86641 cgttgtgatc gcaagcctac tgcgaggccg gtgaccgcta ctgcaaggat catgacgata 
86701 aagtctagga tcattttcag ccagacctgt atggaaagga gcatatagtt ggccgcctga 
86761 gaagtctgca agagttgcct ggcctggcat tgaaactcat ctattcgacc aaaagcccga 
86821 acggttaaga gaccagaaat ggtgtcatta acaaaacccg agagcggagc cttggactca 
86881 atgtccatga gccggagctg cttcgaggtg cgcaagtaaa acctctgcaa cactcacaga 
86941 cctcctacca ggaatgggac ggagatcacc aggtacttag aggataagag aacaatgatt 
87001 gtctcaccga gagcggcgca aagtgctaaa agtactcgtg agtggaatgg ctcctgttaa 
87061 tggaaagaga aagtatacct gtgaaggtgt tcaaccaagc catggggagg tcgttgtcca 
87121 acaacatgat gtcttgaata aatctgtttc gagccatatt aggaaagctc aggtattgtg 
8 7181 gatatcaaat acctgtttgc gatggaatca gaccccttgg agattacgta ggagactggg 
87241 cccctgcgag acgcagttag ttgatgttga cgcagacacc aagatatgga agcagcacct 
87301 tatcagactt tccaactgat ccccatagat gccaatagaa gatgaagtca tcagaaactc 
8 7361 gaagaaaaca ctgttgggaa accctggttg taagcgggta accaatatag accggagagt 
87421 aatgaggacg gaccagcaag ttatatagag gaggatcaat cctcctacta ctaagcctgt 
87481 gaagatacca acataataag ccacgtggtc tggatgtttt gactctgccc aatcctgcaa 
87541 ccagacacct cgaggaatgt tagcgctcgg tctgtagtac ccatggatga ctcgagaact 
87601 gaagtatata tcttactttc gaaggagcgc tctgccccaa ttgccattgc cagagccaca 
87661 aagaccagaa agccggtgac actcattgag cgggcgaaga aaagataaat cgaccagtct 
87721 gatggggctg tggccgcgct ttcatcctca tcaggtaggc tgcgagccac tgaatccacg 
87781 ctagccacgg tgatactttc tgtaggagtg gaggctgttt tcaaattctc aaagttatca 
87841 ttacccccca ggtttctgtt tccaatcatt tcttcaatat tggaagtttt cttagatgaa 
87901 ccacgaaaga ggatacggcc ttggtccata gtaatgacat catcaaattt ggatatgcac 
87961 agatctgagc ccggtattag cttcaggaat atcgggtagt cgggagcgag gagaggggag 
88021 gaaggatata ccggagtgag tggcaaggac aacagtgcac tcgaggcgct tcaggagacc 



88081 aaatggggag aagacagctc cgataattcc ttgttccgtc ttcttatcca ggccgctcag 

88141 tatgtcgtca gccagaagca gttggggccg ggagtacaga gcgcgggcaa gcgactaagc 

88201 agagtcagca tgagacggaa gcatacgaca caatcccact cacaattctt tgacgctggc 

88251 cgccgctcaa tctcgagccc ccgtggccga tgatcgtgct gtccctctct gatagactgt 

88321 cgagatcggg aagcaggtga acagcctcca gcacctgctg gtaccatcca gaatccatcg 

88381 cacggccacc aaggatgtta ttgcgaatag tgtcgttaag aagccagggc tcttgagcac 

88441 aataagccat acgtagatgt ggacggacgt agctgtatcc ggacacgaag cagagctcgc 

88501 cgaggattgc tttcaacaga gaggatttcc cggaaccaat gctaaatctc ataagccaaa 

88561 aagtctataa actgaaaata tgtgcactgg aacagacctt cctgctacga tgaccgcact 

88621 accctgacgg atgacgagcg taaggtcgga tagcaaaacc tctccctcgg tgttcccccc 

88681 agagtaacac gcattgtgaa atttaatgag tgtgccacac tcttctccat ctagtgacac 

88741 agaagcatct ggtaagggtt tgcagttgag ctcaaatata tcctttccat gggcatcaac 

88801 cgtttcttgt agggacagag tgcccttggc ttgcatagat gagtgttctt ctaaaaagga 

88851 ctggatgcgc tgaatgcact tgtatgctga aaccagtaat ggggtttctt gtatcagcat 

88921 aaatagagga tctgttgaaa tctggagcag agccaaagag gtgaacatgg tatcagttgt 

88981 catcacttcg tcgcgcgtcc ttgacacaat agcaaacccg ccgtatgctg ctacgagagc 

89041 tatctgagaa gatgctgatg ctgggacagt cgataagata gaccagatta gcttggatca 

89101 ttaagcctgc tctgcagcat cggggagctt acacgttgtt ataattaagg ataaaacacg 

89161 acggaattta ctgccctgaa tattagctga cgatccatga gtctagtcca gtctccttac 

89221 cgtgccaggt gaacttcgac gatccttagt ctcttcaatt ccttcatata ttgggcggca 

89281 agtcccaaca tctttacttc cttgtagtga gacaacaggg tcatggttga ccggactcgc 

89341 ctatccgtgg ccgcgagcca tttgatctgc cgctcgccaa ttgggtcgcc atagagtgtg 

89401 ctcaataggg tcagtacaac ggtacttaat agcggcccta gaaatctgaa ctagttttag 

89451 cataaaaaga ccccgtatgc ttctaaatga tgaacttacg ctagccctaa cttccagtaa 

89521 atcatataca ggcttacacc gacggagatc acagcagccc agagatcatg aatcacacgc 

89581 gcaccctcaa tgatatgttc catatctacg ttgatcaggg tggtgataag gcctgggtca 

89541 gctttcgcaa ttggcaagcg gagcaacttg ctatagatgg catcaaccag aacgttgcgg 

89701 atctgcaacg ccaggcgatt tgacgcgtgt gcataccatg cgttgaacat ggatgtgaga 

89751 ccgtaaagca acgcgcatgc cccaatcaat ccgtaaccca cactcatccg ctcgctgtac 

89821 ccttcgccct ggacaaagtc tagtatgcgc tgaagtagca gaggttggac aagggtcacg 

89881 gcaatcatta ctccacgcgg cagggcaggt atcagcagtt caatgcgtag tgttagaaaa 

899 41 agggctcgta gcagagaagt tgctttggat ccttgcacct gactcaagcg agaatattga 

90001 ccaaatcgag tattgtaatg gcccgccgtg gcggcatcat acatgccaaa gagatcgctg 

9 0 061 tatttcaaag agcgcttgtt tcctttcagg gagaacgcgg aggtgaaaga tcagcttcac 

90121 agtctaaatg gactataagc atacggagag acctaccatc ccacagtgta cgcgctagcc 

90181 agaggaatag gccccgacca tagagcccgc ttagctgctc cctcgaaagc gttgccagtg 

90241 ccgcggtagg caagtgatgg ggcttcgtcc gggcctcagc cctggccaag acaacagtga 

90301 cggcgagact gatcgtctca acgatagcta ggccagtatc cctttgcata agccagagag 

90351 aacggatacg caccgcttga aagatgattg ttactgaaag gaagagactg aggaacagac 

90421 tgggtgcgac cgcggtgcgg tgcaatcgat acgaccacca cgctaaggga aacgctgtta 

90481 cgaagtatat aacctccgct gcgatactgg tgcggactcg aattggagtt ctcgaccaca 

90541 atatcagcga ggccaatcgt aacacagtgt aaagactgac tgccagctaa ggtgagttag 

90601 cctgtcgtca tccgcagtat tatggataca ttcggcaccc ctaccggttt gcaggtgctc 

90661 agataaggtg ttgcatgtaa gagctgcctc cgctcccttc cgcaaagcac gccaatgagg 

9 0 721 gaagcgacaa tgaagatgat actgggcaga atgatcagga tggtatcctc aaaggtaagg 

90781 gtgaaatcaa agccaccgcg gcagctctgt gcccacgggc cccaggattc atcgatctgt 

90841 tcgcaggttg ctatcatagt catctgaatt cggtagtaat tccaagagat agatcaagag 

9 0901 attaatagac acgattgacg cagaaagaag cgggattata tatgacttcc gtatcagata 

90951 gataagccag tgattggcct gttatagtca caggtaggct gaaaatatac acagaataac 

91021 aagtggtctg gattcgaggt cttatcattg gacatccctg ctagtacatt gcaccattat 

91081 atctggaccg acggagaccg aaccccccaa ggagtgctcc ttggacccgg cgataacgaa 

91141 aatcacctaa tctatcccca taactctggg gattgtcaag cgggccagga atattttcgc 

91201 gacatctgag ggtcctacca ccaaaatgtg gatggagggg aggctgaaaa cctaattgtg 

91261 tgggacacct gaatagtgag taacccgttt gagtcaacca aaaatccatc atgtcaaagg 

91321 tgtacaaagg ctgtaacatt gacccaaggc ctaccatatg cagcaaacag ccgagatctc 

91381 caaagttcaa agatcaaacg cataactagc caatggccag acaactcaga gatgcataac 

91441 gcatttcgtg gcaagacgtc attagaaaca gaacctccta tctctaaacc aacccagctt 

91501 cctgcttaac cgcctccagc caacacaaat catccgcata tgtctgattc cacggatcat 

91561 cactctcata cgcgaacagc ctcgcacgat cctgaatcat ctgatccgac ggcctgcatc 

91521 catttgacat ctgttcgcga ataaacgcca acagatcgcg gtgcacatca aaataccgat 

91581 ccaaagcagt cggcacatcc caccgaaacg gaatttcatc gtcctcttta tcctcctccc 

91741 gcggcatcgg aaacacatcc gatgtccgcc acggatcagg cgtctgcgcc tcacgcccaa 



91801 tcaaatatgg cggtatcgca ttctcaacga gccgctgagc ttccggctcg aaaccccaat 

91861 cgcccatcca ctgcttcatg tcggcaccgt gcttgaaatg ctctgcgaca tggtcgactc 

91921 tctcctgcca ggattggaac gttgcgctgc agaacccgca gcgggagagg atacggtggc 

919 81 ctcggctttc gcgccaggtg agcatggctg catggaaagt gtcgacgcgg tgggttagtt 

92041 tcaggtgctg ccggaggttg tccttccgag tgaagatgcg ttgctcggat ggtttctcga 

92101 gacaggtact gtagttgtgt gctgtttcga gatggtcttc gtctgcgttg gcttgttgac 

92161 agaagacgca cacaggtgtg ccgttgttgt cgatgatgag tccgccttct ggagaacagc 

92221 gccattggtc gacgggaagg tgaagagcct tctcgtggcg ctgccagtcg tatttcgttg 

92281 cgaacgtgtc cgcgcagaac gtacactggt agggcctacg agtccactgc agacggtttg 

92341 catgacggat gtgtttccga cggtggcgac gagatgacgg acgatgaagt ggggttggtg 

92401 gacgttgtgg cggcctatgc gatgataagg cgaccgattt ctccgagctg gatgactggc 

92461 tgcgtccaca gctgctgatg gaagcatccc cgagaacgaa ggagccgctg gagtcgttgc 

92521 tggaccaggc gtactggggt acggtgtatg gcaagctgtc cgaaaaagcg gaaatatcgc 

92581 tatcggcgag agctcgcagg atgtcatctg tggctgcggc ctcgctttcc ggtggcgagc 

92641 tcctccagcg ttccatgggg gataggaatg acgggctctg acgggcagac atatacccgc 

92701 tggagcgctt acgccgtcga gcgttgataa accagttgga gacttgcgcc cgagtcagac 

92761 tggtgcgttg cgccagctca tccttctcct gggtcgtggg atatgggttc tccgggtggt 

92821 tgttgagcca tgacttgagg gttctcacgg cctccacgga tagacgctga ggagggcgtg 

92881 tagggggggc aggcccgccg ctatctggag atactgatgt cgcatctggc tgggatggaa 

929 41 gaggagagaa gctatcggga agctccgacg gtagcgagaa gatggggtct acggtggcgg 

93001 gatatgaaac agcgtcttca tcatgcagcg gtggcaggga taatatatca tcacataatt 

93061 tagggaacgc ctggtgagcg tgctccatct ggatagtatg cggacatgct caccagagta 

93121 aggacctaga tcgatgtgaa agttgaggtt gcaagcggac aaaatcctct gcgccctgcc 

93181 attgacatcc gctccacttg ccggctgtcg gccgccttcc atggatcctc ttattcgctg 

93241 ctgactccga ttgtagccga gatcggagga taagagtgag gtaacctaat cattaagttt 

93301 tcaatctaag attcgaatag ctttgagtca ctcaattgta gccactgtac agttcatctt 

93361 cgcagttagg gttttcaccc tactggtgct tattttaacc tctacaaact atcaggaatc 

93421 gtaggaattg catccctcca tcgttgctcc aatatggcct ccacgtctac caccactgcc 

93481 gttgaatggc agcagggtcc cccgattgag cttgagtcca gggctcccaa aaatgccgtc 

93541 gcaccggtgt cagatccgac cgatgaggtc atggaggcct cccgtctggc cgacgccgaa 

93601 gtccctgaag gaggagcagg gtggatagtt atcgcaggtt gtgcagtgat tacctggtgg 

93661 ttcattggga catcctactg ctggggtgtg ctacaggcag cgctggtcaa ggacggagta 

93/21 tcatctgcat ctaccctgtc atttgtgggc tcattagcac cagcgtgcat ttcgtttctg 

93/81 ggaattctca atgctcgcgt gatccgcaaa ctaggcacca gaacgtcggc actcctgggg 

93841 atattcttac ttgggttggg agagatcctt agtggattcg ctgtccacga ggttggcggt 

93901 ctctttgtta cttctggtgt ggtgatgggg ctcggaacga ggtatgtcac agcactcgtg 

93961 acttattgta gtttgctaat aaagtagtat ttctttcatg gtggtctcca tcacaccagc 

94021 ccaatacttc aaggcaaagc gcggcattgc caatggtatt gtctacgcag cgggtggtct 

94081 cggcggtgct gccatcagtt tcatcctgga tgccctgctg tctcgcgtcg gcacagcttg 

94141 gacgttccgc atcctggggt tcatcacgat gggcaccggt ctccctgcag ctttcctagt 

9 4201 gaagcagcgg atacctatac ctccatctgc tttcgtagag tggtaggcac agcaataccc 

94261 accatataat tcgatcaaaa tccgctaaat ctaccccagg cgtttattcc gtgacatccg 

94321 attcctgctg ctctttgcag ccggagccat cgcgaccttc ccactactcg tccccccatt 

94381 cttcctgccc ctgtacaccg actccctggg cttaggctca gctgcaggcg ctggtgttgt 

94441 ggcagcattc aacttttcct ccgcgctagg aagattgacc tgcggatttg ccagcgatac 

94501 gattggcgga ctcaacacgc tgttcgtctc gctgttactc agcgcattga gcatgttaat 

94561 aatctggcct gtttcaacgt ctattggccc gctagtcgtg ttcgtgatca ttaacgggat 

94621 ggcgaacggt gggttcttct ctactattcc tacggttgtg gggaatgtgt ttgggtctgc 

94681 cagggtgtct gttgctatgg gtatgattgt tactagttgg gcaggggggt atcttttggt 

94741 gagtgcagtg ccgtgaatat gccttagcgt ttgcatacta ataaatggtg tttagggatc 

9 4801 ccctattgca ggttacatcc tcgatgcctc gggtggcgag gatgccggga tcaaagcgta 

94861 cagaccggcc attctgtatg cgggattcat ggctttggga gcgtcaattc tggctgcttt 

94921 tatacgtttg aagacagata cgcgactgct taagaaggtg tagcttattg tcgaaccaga 

9 4981 accaactgga atacgcttgg tgaactgaat agtagttatg gtctgttgag gaagaacgat 

95041 atggctatac gtaaagaaat acagtagccg agaggggcga catgcgtttg cgagacggca 

95101 attgaagaga attgccggtg caggtcccaa cagtattgtg ggattcttga tcaaaccagg 

95161 tttaccattg ctactagcta tcaatacgac aagccaagtg caaggtgtcg tccgttcgta 

95221 tcatctacca gagaactcat ccccactacc acagacccct aaccccaaag taccactaat 

95281 cagacaataa agtaacaaga taattggaat caaatgtgac aatactctct ccagtaaccc 

95341 cggagccaat aagcacaata acaaagtagc ccaagttgca ccacaggaat gtgacgttgc 

95401 cggatccggc ctcaacaccc accctcatcc acacccacat aatgcctatt ggcccgacac 

95461 ccaaccccag atccagtgtg acctgacact gcatcaaaaa tctcgagctt attggtagat 



95521 taaatctcat tggtgtgttc ctggatgatt tgataagtgg ggtcggtttc tatggaagca 

95581 tgagatgcca cattccgatg cgggaatggg aatgtagcgt agatgacaat aatcgatcaa 

95641 cacaactcaa tcttctttct ctcccttcca actctcactg ccgctcaggg agcagagcag 

95701 atctgattca acaccaccat caccatgttg ctgcgaatat ggcacatctt gtataccttc 

95761 atcgtagtcg tgcaggccat tgactacgat gttgacgatc caggtactgt ccctaccttt 

95821 caatgcacaa tcccagtgcc tacatacata cccgaacgaa ctagctaatt aacctgaata 

95881 gactccatca aagcagcctg ccactccgtc gcccgccaaa tgctaaccca ctacacgggc 

95941 aaccaacccg gcgacaaccc gggcaacctt cccgacccct actactggtg ggaagccggc 

96001 gccatgttca cagcgctagt cgactactgg tacctcacct ccgacgacac atggaacaac 

96061 ataacgacac aaggcataac ctggcaagcc gggcccagcg gctccttcat gccagcgaat 

96121 caaacccgca ccgagggcaa cgacgaccaa tccttctggg ctttcgcagc catgtccgcc 

96181 gcagagcgaa acttcccaga cccgcctcca tcctccggaa gcccagggtg gctcgccatg 

96241 gcgcaagccg tattcaacac tcaagctgcg cggtgggaca agagtacctg taacggcggg 

96301 ctacgatggc agatattcac gttcaacaat ggctggacgt ataagaatac catctcgaac 

96361 ggatgttttt tcaaccttgc tgctaggctg gccaagtata cagggaatag cacgtatgcg 

96421 gactgggcgg atacagtgtg ggactggacg ggtgaggtag ggttcatgac ggatacgtat 

96481 cggttttggg atggggcgga tgtgagcagt gggtgtgggg attggaatta tattgagtgg 

96541 acgtataata cgggggttta tctgctgggg gcggcggtta tgtataattt ggtgggtttt 

96601 tctctcccct gattgtgatt attggtggga ggtggctaat ggtgacagac tgaatcgccc 

96661 gtctggaaag cccgaacaga aggcattctc aacgccagct tcgtcttctt ccaagatgat 

96 721 gtcatgtatg agcgggcatg tgaacctgtg agcacatgcc aggtcgatca gcgctccttc 

96781 aagggatact tggcgcgctg gatggccgcg acaacgcaaa tggcgccgtt cacgtatgac 

96841 ctggtgatgc cgaagctgag ggcgtcggcc aaggcggcgg cggagacatg cacagggggt 

96901 gaactccagg cgacgtgcgg gttgaagtgg acggatcgga agtgggatgg aatggatgat 

96961 gtggggatcc agatggcggc gctggaggtg atgcagtcga cgttgatcag tagggttgat 

97021 cctcctgtca cgcaggacac aggcggcacg agtcagggca atccggcggg tggagagcca 

97081 ggacccccgc cggcgccggt gccggagggg ctgaggttgg agatcacgaa ggcggatcgg 

97141 gcgggggcag ggatgatgac ggtgatgctt tcgatgatag ttattggatc cacgggatgg 

97201 atggtttacg agtaatggtt gggatcttct gggttatgag ggtttcatta tgtacatgtt 

97261 catattgtca ttctatttgg taactatgct ggtcacagaa tcactttcct acagggatat 

9 7321 actcgttgac accttggact gtgcacagag cagggtgtcc ctccctcgcg ctctccatcc 

9 7381 cagctaacat gttgtccagt gttttcgtat gtcccttttc gtatcctctg atacggaggt 

9 7441 actccaattg caacggcaaa tgatccatca ggacgatatt tcccaactca cccattccag 

97501 tagcaagaaa ccacagacaa cttactccca gactcaagct cctcagcgat ataaactcct 

97561 tcaacacact tttgtcctca gcatcatcct gcaccccgct ttcacgctcc acagcaacaa 

97621 attgagacag ctgatcgtca aattccaaat cgagcacttc caacgtggcc ctatgtctca 

97681 ggatagcctc tatcaaactg taccgatcaa agaactctgc aaagcccgca tagcagcccc 

97741 cctgtacaac tgagtacctc aactctttca agcatttcat cgaacagata accttgcaca 

97801 acagatccga cttgaaatcg caccccctca gatccaatcg cgaaatattt gtcgacgaag 

97861 gcggaagcaa accgtccaac gcagcactag aggcgttccc aaacccctga atagcctgca 

97921 tccgcagcga ttcgatactg ggaagggtac gcaccagctc aatacacaga cagaggtcct 

9 7981 ggggagtgca agcacgggac cgcgcaatat gaaagctgga ggcttctgtt gttagaatct 

98041 ggaagtgctt aacattagcc aagtagggga tctttctttc gcccttttcg atgagatgac 

98101 ttgagttggc atgtttgagg aacttggcta gtgggtactt ctcgatcttg gctgcgaaat 

98161 ggtggttatc ggaagggtca ttaaatcggg aagatgtgta ctcccaaaag ggaggggaga 

98221 tagcaagaga cgtcaaattg ggagaggatg ctatgaggag gaccgtcaag gcttggccga 

98281 tgaaggtgga tttgccgtgg taagcatcct cttgggtact ggctgggtta gatacggtgg 

9 83 41 tgatttcaat aacggtaata gagaacgcac ttgtagtatg tgagatactg ccaatcctta 

98401 gtgtagtctg tctcctgcat ggccatgttg agtagtctct cctgttccgg accgataaag 

98461 ccggccctgg tggcggccgc acggagacgc tgcatgtcgc ctgcgtcaac agggcgatca 

98521 tattcccgcg gcttgtagtt cagcggagag cggacgggaa ggtccatctc gatgtgacgc 

9 8581 acataactgg cgaggttaca gtcatgcagg atgtcaagga ggatggtata ggcgtttttc 

98641 aggtccgggg tgcagttgat gcggacacag cgatatctgg ttttatttac gaggtcattg 

9 8 701 ctgtggatga gcgtgaacag cgatggactg aggagagaaa aggagtagac atctccgctg 

9 8 761 tgatcgagaa aggagatgac gtgcaagaga atgtcagcgg ggagaaccga gagggccggc 

98821 atgattgtgg ggattggata ggaacaggtg aggttggaat acacttgcat tactcgtcgc 

98881 gcttggaggg gtcttcggtt gcggcacatt cgggtggtcc cactcgcaca ggagggggac 

98941 gtgcgagtga accgcgatgg tctaggcctg cctctcacca tctcgcctgt tcgatcgagt 

99001 taatttaagt taatctgttc actattccat cggatttaga cctaatataa ccctttctct 

99061 ggataaaata aagttaggat ttcgcaaacc ttcccaggct actcgacatg tctaggaact 

99121 gtcagcacat tctactctta tttattcagt cagaacataa gatgtttatt tgcatatatc 

99181 gataatctac atctttggta ctaataatgc cagctggcgg gctctaagtg tgaggctgtg 



99241 aagtaaaagg acagtgccag acgcgctcca agaccgcgtg tactagcgtg tatttgatta 

99301 ctagtctcgc acttacactg accatatgac gagcgctaac catacaaatt ggtagaaact 

99361 caaaatacca ctgctgtata ccattcctta tgatatgtat ataactagtg cccatcggcg 

99421 tattacacag gtaattgaag agaaactcaa accttttctg atagtagctg tagcaagacg 

99481 cctaacccag agaggaaaag tgttcatgat ttcaaatgac tgtccccgaa tgtaattcgt 

99541 gaaggatcgc cagatcttcc attccctatg tgtcaggaat ttcggcgatg ggtgtccgca 

99601 tatctgggta cggagttgtg atatactaag tgacaacctg ccgatgtttc atccattgtt 

99661 atgagtatct acccaagtat tcgctttcct tcacagtgga gggttgacat attcactcta 

99721 tatgccagat ataagaaccc ccaatacctt ctattggcaa gtgaaggaga aactaaaccg 

99781 acacgtccta tcgtccatta ttatttgtcc ttgtctgaag catcgagcat gaaaatgttc 

99841 gggttttgtc gtggttcaat gttataagaa tgtctagtgt cgtaattctg ccttgcagat 

99901 tctacctatg tcccttctta tggtctctaa tatgtgattg tcagttattg tctttgtgcg 

99961 aaaacatcag agaattctct taatattatt acgtagaata ttgccctgct tcacaacgga 

100021 cctcttggta gaaatcacac tatggatagc ttgagaaagg acttcactga tgaacaatat 

100081 tgtggcaagc gacgcgacag gtttcccgag gttatattca tactcaaatg tctatgtgga 

100141 tcacctgtgg taagacagac ttcttacttc catgaaggca tgatatcggt ggcttgtcgt 

100201 gcgctttaaa tagagaagaa cactttgcaa gtgatcaagg aataatcaaa gactataaca 

100261 acatgattga ccagaagtat catggatggg gcaaattctg accccaggct cttagcagga 

100321 tatacttctt ttcctagatg gagtgcaagc caaagtgcgg gtctatggcg cctgatattc 

100381 cagaactgcc ctgtatctaa tcctacccgc tctcagtttc tcaatcgtct cttcaaggcc 

100441 gttcctggta agagggaaag tcataatagt cggtttgata tcaaaccggg cagcaaactc 

100501 caacaactct tgcagattac gccgcgaagt gatgaagcat ccctgaatcc gtatcccctt 

100561 ctgaccaaac ggaactaaag gtataggagt aggcttcaag gtcacagaaa tatgatatat 

100621 agtcccgttt gtatcaacca aatccagtag actgcgataa atattagcat gatctattgc 

100681 attttgttta gagggtagat gggataccca cgaggcatag tcaacatcag agctcccgca 

100741 caataaaaga tgtttgacgg gtttgatgtg atttggcgtc tgagtattgg gaaagcgata 

100801 gaactcagac gctccaaatt ctagcgcttc ctcgcgtttc ttttccgaac ttgacaacgc 

100861 caccacgtta tatctcagtg ccctagcaag cttgatcgcg agatgaccaa gcccccctat 

100921 tcccatgaca ccaacacggt cggaaggtcg gattccattg ttggtcaagc aagcccagac 

100981 agaagcgccg gcgcatagca gtggtgctgc atgggccgaa tcgtatcctt ctggaatggg 

101041 atagacggca tcggcatccc agacggtgcc gtggctgaat gatccaacat cgaagtcatg 

101101 ttggccgtat gctcgttttt ctctgcagtg atggtcgcag ccttgcaaga ttcgtatgca 

101161 ttaggttgaa tagaacagag tactgatggc ggagtggaag ggaaccgaca taccacttat 

101221 acagttgtcg catctcccac aaacacgacg gataaagcca aatccaaccc tgtccccaac 

101281 ttttacggaa cttacgctgg gaccaacgtg ctttacaaca cctacccctt cgtgtccaag 

101341 agcttggtcg caatgaagaa agtgctcgtc tgttccgcac aggccagaat gggtggtctc 

101401 gatgaagaca tcattatctc cgagtcgccg agtagtcaat ttctccacaa cctttccatc 

101461 tgacgacccg caaaatactt tgaatgtata ttccattgtg gtcttggggt tttcgttgtg 

101521 tcaaaaatga aaggcttact gctgttgcgt gtataggcgt taaaatagaa agttgacacg 

101581 cagagagaat gcccggggct tcaaaggcga cccgaagcca acagatcggt agcagtggca 

101641 gcttagccct tacttgctcc atagctgttg atacaggtag gtctgcttct ggagaactac 

101701 ggtacgagct aatattttga gacccatgag aatgcgagat tctgtaaagg cgtccggcct 

101761 attctggttg agtgcatatg ccagtgtcct cgttgagatc tagataaaag agttgcgaga 

101821 gcgcattttc ttcttcggga atcatacgag gaggtacata cattacagta caacatgact 

101881 gcaaaatgta tggtgataaa gaaggctgaa gaagatcata cggaattttt gggtttacac 

101941 catgtcggca gtgacaaggt tagtcccaac agtgtaaagc ctcgatcctg ctagaaccac 

102001 tggcccccca tttctagaac gcatatacct gataatttaa gcttactttg ctgtcatcac 

102061 accattatca cacgttaacc ttcctcatat catcagacaa gcgactgtcc ctgttttgat 

102121 gttaccggac gctgcgctat accatgactg gaattcctca ggagagaaga tcattgcaca 

102181 tctatactgt atttggagac agaattttga taactgtgtg tgagtgtttt gggacattga 

1022 41 tagtgacgaa taagatagta ccaaggaagt gaggggcccg cagatactga agaagttctt 

102301 attacgtcag tggattcctg cgcaaggagg aagagaacag caacctgaat cattaatcct 

102361 gtaagctgca ttatgtaaaa gaatgtaaga gttctccagg tgtgtcgcac agaatacctg 

102421 caccaaactt ggcggagtaa atggtgtgga ctgcataacg cagataaagc attggcctgg 

102481 gtgtacagaa tgaaaggcca ctgaaaaagg gaaatataag ttaaaacaac aaggcatgat 

102541 acttactgtg tagctcaagt gtctcgattg atattctcgt caacatacca tgaatcaaga 

102601 cagcatgctt catgacttag tacgaaagaa tctaccttcg atatcagtcg acgggatact 

102661 gctccacgcg gcattgttga cagttaatct gattttgaaa ctgggagggt ggcccggtct 

102721 gttgaagttc atgctaagaa ctttcgccat cagttcagta ttggcaccca tcggcctgat 

102781 aggctttctt ttcctggtgt cgtttacccc gcgtaacatt aaatctttat caaaggacaa 

102841 gtccgttata gggaagcccc ttctcttccc catcacgctt gaccacactc gcctgtcgcc 

102901 gataaaaaac aatttcacct tcaatgtttt gtttgttgga atacctgtag gaatcagttg 



102961 tcgattcggg cgactgttat ctattgatgc caagcataca gatgaagagg agtgtactga 

103021 gagatcttct cttctgaggt tgttacaaac ctacttttcc agctggttca gctttgactc 

103081 agctcggtac ctccaccgtg gtgatgatac cctcagctta gagaataagc taaataaatt 

103141 tctcagagaa caggtaggcc atctgagcat agggtagtgt gaattaggct gacatggcat 

103201 taagaacgag aaccctgcca aatggccgta cgcgtatatg ctgagcgtac cgcggttcct 

103261 gtggtgggaa cggagcgtgg tgacatggtg gtatctatat tcggaaagca aggaattgga 

103321 cgcggtgatt atggagatca acaactcgtt tgatgagaag agaaacgtat tattcaaggt 

103381 gcggcgtacc agaatatata ctgaaagccc agaaaaaggt tttgagcaac tcttggactg 

103441 caaagaggaa catttagatg aggacaaaag ggtattttcg ctgatcccac aacatgggaa 

103501 atatgcctac aaagcgacct ggaaaaaaga gattttctcc agtccatttg aaaaagttgg 

103561 agaaacagtc tctagcacgt ttctggaccc tgtagtgccc tcatcatggt caggaaatcg 

103621 atctctctca aatacgacga ccttcgatcc ttctggtgcc cctagaatga tagcaaggct 

103681 ttggtgcaag gtaccaccca tcgaccccgg caaggcatcc tctttccaaa tatttagtat 

103741 cctcttgata tggactgtga atgttacttt ggcaacaccg agaattctat tccaagctat 

103801 tcggcttcac gtcatgaatc tcatgaggat gatggagcac ccggatgtac ggcctggaag 

103861 cgagccaaga cgtccgagta aaggggaaag gaaagatgca ccactcaaga tgctacatag 

103921 cctatttgat ctacatgcta acctacaaag caggatgctc gaacgattct ttcgagaata 

103981 tctaaaacac atagttgcct cttacccagg cgacttggaa gtcacctaca taccttgtaa 

104041 atcagtattc aaaaccacaa tttgcctaag gtctacacag tactccgcag aaggtcaaga 

104101 tcctaggcgg ctgagacttg aggttatgga tcctgccttt tattctagaa tcgtcaattc 

104161 tccgaatgca ggcactgcta tggcgaagga gacaaaacct ggaaggtcac ccgctgatgc 

104221 attatcctct cccgtcattg catccgatat aactcaatcc ctacagctat tggatgtgac 

104281 ggtatcagag aacagaacct acgataaagc gtgttccaga atatcgtcat gggtactatg 

104341 tctccgccgt tggctgactt cctcattcat ggattccttt gtttgcaagg ctctcccgcc 

104401 tcgtacacaa gaggaatacg tctcatgtct gatccaacta tggctggaaa aattgacatg 

104461 ggacttcttg agtcctcacc agatataccg cgtgattagg gctgcgatgt tgcagtgggc 

104521 tatatgctgg atactattct aaacagcatg acaatttgag tgcttaggaa tagagagagg 

104581 tctagagata taacaggatg ggtgtagttg gtctggttct gattttcctt gtaatttaag 

104641 aagctgaaag tttcacgctc cagtttgtct tcgaatacga atattgttac tcaatcatga 

104701 ccaggctaga ctattgaaaa tgccacaaga atcaagctgc aggtcccggg ggtctctgag 

104761 aagaaaagcc ctctgtcgct gaaacaaaca catacacaaa caatatatat atatatatag 

10 4821 tcacgcgtga gcactatcaa ttccgtgcga aggcactgga agatattttc tttttgtgca 

10 4881 ggtgatgggc gacggcggaa atcgcctcct acgtacccct ccatgtgacg taaaactctg 

104941 aaatgactag gattgctaaa aaatagttaa gaagagatac atctcttgga atcaccgcta 

105001 taggttgcgt ttccgtcgaa ggttttgcat gatataccaa caagaacaat tgatctccaa 

105061 ccctacaagc acaattgtcg gccaaaaggc aagcccacca ttccaagcaa agggcccagt 

105121 tacatgcata tgaatacccg agcccaacgc aaagactatg ggtacaatca ggttaatcca 

105181 tcccatggcc cttggaaaga cggggtcagg actattgtca gagaaaacag cccaggcagt 

105241 cgtccaccac tgcacccaaa atatgggcca tgggagaagc atactgagcc agaataggtc 

105301 atttatttac tgtgtcagct ccgggccgta gtcgcggaag gtcatagcgg cgagtaaggt 

105361 cgcgggtacc atgaaggcgc aaaatccggc tgcagcactg gcgagctgga ggtccgggat 

105421 agttgggtta aaattgaaaa tatggcggat tttacttgta attgcggctg agtaagggac 

105481 gtacagaggc cccgatacat gaaaaataca ctcgctgccc gagtcccttt taagtaggtt 

105541 cgataatggt tgtaaactct ctttgcatga tctatgcagc acatttgcct tgtgtctatc 

105601 atttctcttt ctgccccaac gtacatgatg taacatatca ttgttcatga acgtctctat 

105661 tacagtattg atacaactca agcgatttct tggccagaac tgcacccctg ctttttcggt 

105721 agatattcta aattgtccga tacatcactc ggtgtatttc attttttgca tatctccatt 

105781 ccccttcaga ctctccctgc accaaaacac ctacttgcat atggtgttac acatacgggg 

105841 cctcagcaga caccactcaa gcgaagaatt tcatgcctcc caaaacatca gaacattttt 

105901 ctcttgagcc ttacaacagg gcagtgagat ctgcatcgtc ctgaaagggg gcaggagaac 

105961 tatctgaccc ctcacctagc ctggcgatta ataccgatct ctgtatcagg caatcgtatc 

106021 ttgtcatatg gtcctcaggt ctcgtgcttt tgcttggcgt tgagattgat gaattctgat 

106081 atcataccga gtggaaatct ttttgaccat tacctcttga ctttattatg cacttggttc 

106141 acgtttgttt ttctataccc ttgcctcatt atttagctca tctggggatc cagggccttt 

106201 ctgcacgacc ccaaagtcta ttcggatccg atgacattca atcctgagag gttcttaaca 

106261 acggagagct accaggcaga gcatgatcca cataatctcg cttttggttt cggccgtaga 

106321 atttgtcccg gtcgtggctt tgctgactct accatatttc ttaccgttgt ccggtcgttg 

106381 caggctttcc ggattgccaa aatttctgaa gatggaagag aaatcgagcc cattgtggag 

106441 tatctgcctg gagtaatcag tcatcctaag ccttttgcaa tctctattac gccgaggagt 

106501 aaagagcatg agtcgtttat tcgttccatt gagatagagc atccgtggga gaagggggac 

106561 ctcgtgttgg tttaaaatat tggtcttaga tacacattta tgggctcaat attcgtatat 

106621 cagcgtctag tccaagggct taggaggtcc atattaagcg tcaatgtatt ggtagttgtg 



106681 gcaagcaagc tcagagtttt caccatattg acttcattat acagattgcc gctaatcggc 

106741 aactgcaaac ctccaaatct ccttcttggg agaatggggt taattcgatt ttcacttcat 

106801 taactcggca tagccaagtg caagtgatca gacatatcaa tactaaatta gttgtagagt 

106 861 catgcgccgt gcaagtagac gaaataggta gtagtgcaat cctggagatg gagatgagtc 

106921 atgcgcccat gtcctcctag atgtatcgtc acatttgcgg aagactatgt agatctaaat 

106981 aatgtctcct gcatttacac tttcccactt aagaatttcg gggttgcgag ggggtcaata 

107041 gcagcaaggg acgcaaggaa gaatggcgcc gacttctatc atacacatga gagaagagat 

107101 actgcataag gggtgtacct gggcactaca agccttaaaa tgacattctt acggagagat 

107161 ttctgagagc atcaatattc caatgaggaa aggaacaggc ctgtgtttta caaaatattc 

107221 ctgttccatt tcaagtttaa attattgatc tgtccaatag tattggaaac gtacccccat 

10 7281 ggctcatctt ctacaaggaa cagccttcat cacaggggca ggctcgggta ggacggcacc 

107341 acgtgaatct tatcgcggag ctgaggcaaa ggaatacagg catcggagcg ggcgtggctc 

107401 gaacattcat tcagaacgga atatccaagc tagcattagt cgacatcaat ctgtcaattc 

107461 ttcaaaaagt ttccgagtcg ctgaaacaga cgttccccaa tgtggaactg ctaacatcag 

10 7521 ctgttgacgt gactaacgaa gtacatgttg acaatgcggt ccagaaggcg gccactacgt 

107581 tcggtcgtgt ggatattgga gttaactccg ctggagttgg tgatccggga gagcctacag 

107641 atcgattatc cctgaaggat tggcagcgga cagttcacat taaccagacg ggtgtgtggc 

107701 tgtcgcagag ggcactgatt cagcaaatgc tcactcagaa gtaatacatg ataaatagcc 

107761 tacgactgca gactcgtgtt ctgatactct acagatcccg tggtacacga catggccggg 

107821 gtgttattgt taacgtctgc tcctcgctag gcattactgc ctcagctacg ggtatccctt 

107881 ttccagcata tacatcctcg aaacatggta cgtttattca cgcggcccaa ggaatattct 

107941 gttcagggct tataagcatc tacaggcgtg atggggctca cgaaaatggt tcgaagcaac 

108001 tatcaaattc taatacgatt cacctcagct gattaatggc ataggatgcc aagtactacg 

108061 ctcccagtgg gattcgcata aatgcagttt gcccagggta tgttaagctt tgattttgct 

108121 ggatctcgca catctctgaa gcgtatacag atttgtggac actccaatgg tatcgaaagc 

108181 agtggactcg ggagtcttta agaaagaagt tgagagtgca ccagtaggaa gaataggcga 

108241 tgtggaagaa attaccgatt cgatcttatt cctcgcatcg ccgatgagca gctttatcta 

10 8301 tggagcagga ttagtcgttg acggggggta tggactctag atgctcgtta tactagtcat 

108361 cagggatttg atcagaactt ctccaccaga atcaggcgag aataaaaaat ttctctgtct 

108421 tttaccatgt taccgcgata tttaaactca tgcatgcctt gtagctgcat ctgcttggcg 

108481 gagctatagt atgcaaacaa cttcgcaaac tcaatatttt ttcttcatga tgggcaccat 

10 8541 acacgagctt aagaccaggg gagagacgac atttacgctt tcatcaacat tcgcgtacct 

10 8601 tgacttagat ctcatgacca ctgggactct ccaggcctta cgtgtagcct atttgccttg 

10 8661 ctgatctcca ccacaatgtt gtggagcaat agggccctcc tgtagaagga aactacatga 

108/21 actgatgcat tacatcgtaa acggctatgt ctatcactgg aaggaatgac tattcttctt 

108781 ccgacctcct tttatttgtg cagacgacta accggaaatc caaaaatggc caacgaacag 

108841 acgtgctcgc cctccattaa tttaaataca catgaaaaca tgtgtaaggg acgcattcag 

108901 caaagacgtt atccagctac ctgcaatatg cttaagaaac caaggtatga atgaaaaacg 

108961 caatatgatg tagaacctcg ctagtggaac atgtgctagc cttactatgt agggtataga 

109021 tgtgtatgcc tcatcccgtg acctaagacc acctactcta agcatgatgt ccccgctcaa 

109081 gttcttaggt tcctcacacc cagatcgaac tatgcacagt tttatttgcc ttttcaaaca 

109141 tgatatctgc atcgacgacc gataccttca cccaagtgat cattatcgga gcaggtatgt 

109201 caggtctggc catggcctgt cagctcaaaa agcaactttg ctgcgaggac tttgtcatat 

109261 acgaccgagc tcctagcttt ggaggaactt ggtactttaa caagtgtgag tttcagcctg 

109321 tgacaacctg agcagactta tagtcacctt ttcgctcata aatggagtag atctaggttg 

109381 cggagtcgac atccctgcgg cgttctatag cttttccttc gctctatacc cccaattcac 

109441 ctgcttcttc ccgaagcaag aagagatatt gcaatatatc cacggcgttg cagacgagtt 

109501 cagcgtggct ctgaaactgg ttggtcatac agagtgggag ggggcagact ggcaggattc 

109561 tgaacagtgc tgggaagtga gactgaggga gatcccgagt ggtcggaagt tcactcgtcg 

109621 ttgtcgaatt cttatctcgg ctgtgggggg actgacgaat cccaagcatg tcatgctgca 

1096 81 aggaatagaa agattccaag ggaatatagt gcatacagca ctgtgggatc aggagactgc 

109741 agtcgcaggg aaaaacgtca tcgtcattgg aaatggaggt agtgacccga gtccttgcct 

109801 ttctctgcgc tagatgctga catttagatg ttgatcctag cgtcagccac ccagtttatc 

109861 ccagctattg ccgatgatgc cgcctctata aatcagttta ttagggtatg cctgccaaac 

109921 ttatggggag cattgctttg tatccatgct caggagcatt atagaccccg catcactata 

109981 tgcccggcaa aaacttccgg atcaacggac tctggagtgc tatctttcaa agcataccgg 

110041 tatgcttctg gctatttcgg gcattcatct ttctttatct agagacctct attctcctcg 

110101 gggaactcac atggatgggg aaactgatga ggagatggga gacttataga tcccgacagc 

110161 atgtgaggaa gtgtgcgcca ggtaagttat tgaatattgc ctaatcttcc cttaaggaat 

110221 atgctgactc tctgaaactg cagaagagca gtattggtct ctccttaccc cagaatactc 

110281 tatcgggtgc aaggtaaccg ttattcatgc gtgtaaattc atttttgaca tactggatgc 

110341 taagcatgac ttagcgccgc gtgtttgata atgacggtta cttgaaatgt ctacaccgac 



110401 ccaatgtcga catcacaaat gatccagttg tggcagtgga ggagcagtct ataacaacgc 

110461 aatctgggaa gaggtttccg gcagatttga ttgtatgctt cccaaaatgc aatttcacga 

110521 gaggccgatg ttgacttgga ttatggaagg tgtttgcgac tgggttctct ctctcgcagt 

110581 acgatgtcga ccggcgaggt cgtaatgggc gtacgcgaca agaacattgg aaccgtttcg 

110641 gctgcataca agcatttgag acggtggcca tggcggaatt tcctaacttt ctctacattc 

110701 tcggacccaa ctctggcagg ggccatacat caactgtcta ttcaattgaa aggtaatgat 

110761 gcctgtgcac caactgttac ctcatagacc attgctaatc ttgacactcc agccacgttg 

110821 acctggtaat tcggattatt aaaccaataa tggaacacaa tgcagtttca gttgtgataa 

110881 aacaagcgag tgaagaacgg tttaacacga agttgaagtc tgcacttagg aagaccgtct 

110941 tcacaaatat gtgcagaagt gtgagtttca cttttgctga cggacatctt actccacttt 

111001 attaataagt tttccatagt ggtacataga cccacgaacg gaccagaatt ggtttactta 

111061 cccatggaat tcatttagaa tgtggtacac gactcagtat ggtgggaagg aagactggtt 

111121 atacacgaaa gcagttgcca gcctcgacaa tcagtctcgt ccgtgttcta accgttgtgg 

111181 aagagaagcc ctccctatcc ttattatgat cctctgcatt ctccttgggt gtatatttgc 

111241 gatggaacat ggaatgcgct agcttcatgt acctaatggt gcgtgagtat cagcatgtgc 

111301 gtgtcagact cggctactca gagctgcgaa actcctaaaa tgtatcatta aagtcacgta 

111361 ttaatacatc ggttctgcac tcctgccttg gacaatgctg tcgcgcagaa tggcctcaca 

111421 tcgcacatcc accaatgaca gattcctatt aaatttcttc accagctaga ccgtcatccc 

111481 acaacggccg tgcgtcgagt gacaggtgcc ccgtggatgc gctaatgcac ctcagcgggt 

111541 catcagacgg cacaacgcta tgcatgggcg tttaagtgac accaaagtag ccatccaaca 

111601 actagggcga tgatattcct ctaccactgt ccatattaga ctttcccacc gtcataatgg 

111661 gctttcctca gcccgcccga agcccaaact aatgcaaggc gttccaccgt aactgaggag 

111721 acgtttctct cctcacgagt cgaatagctt ttaggggata tccctgctcg tgtagccaaa 

111781 gagggggggg gaattaacac tactagtata taagagtaga accaacatat aaggacaccc 

111841 ccctctcccc gctcagccct cctccactcc tccacaaaat tgcacctata catacacacc 

111901 aacacaatgc ccaccatcct cctcaccctc ctcgccctcc tgaccctccg cctcctctac 

111961 gaatacaaac gcgaccgcaa cctccctcca ggcccccgcc ggctccccct catcggcaac 

112021 ctgcaccaag cccctcagac cctcccttgg cgcgtcttcg accaatggag caaaacctat 

112081 ggccctatca tgtccgccca attcgggcgc cagaccctga ttttaataac ctcacccacc 

112141 atcgcgcgcg atctcctcga caaacgcggc agcatctacg ccgaccgccc tgacctcgtc 

1122 01 atggccaaca acatcaccaa gggcctgcac atgctcatcc gtcaatacga cgattggtta 
112261 cgactccacc agcgtctcga tgctccgctg cttagtcccc gcgcgtcgaa tacctaccat 
112321 ccgattcaag atcttgagag caagcagctc atgttcgacc tgctgcggtc taatgacttc 

1123 81 gacgcgcatt tcgagagata tagtgggagt ttgatgttcg cgttggcgta cgggttccga 
112441 cttctgtcgc cgaaggggca agagctgcgc gacatgcgga ccattcaggg taattttacc 
112501 tacgcggcga gggtggggac gtggattgtg gatgctattc cggtgttgaa ttatctgccg 
112561 gcggtggtgg cgccgtggaa gaggcttgcg gagaagttat ttaaactaga ggcgagtgtg 
112621 catacgagac atctggagaa ggggttaaat agtgagccgt ggaattggtc gaaggagttt 
112681 gcggcgtcga agcatgcgga ggggatgccg cggttggatt tggcgtataa tttggggata 
112741 ttggtggatg cggggtttga gacgacgtgg acggtgatga agatctttgt gctggcgatg 
112801 cggtcggatc cgcggtttgt ggcggtcgcg cgaaaggagt tggatgaggt ggtgggtgag 
112861 gataggatgc cgacctttga ggatcaggag aagctggtgt atattcaggc ggttgtggat 
112921 gagacgttgc ggtggaggtc gatggctccg ggtggcatac cgcatgctgc gaggaaggag 
112981 gatacttata tgggatatcg gattcccaag ggagcgacgg tgattccgtt gttttggtcg 
113041 atgtgcctga ctgatgagcc gtgggatgat ccgctggagt tcaggccgga gaggtggttc 
113101 gaggcgacgg agaaggagga aggtcggttc cgcaatttct ttgggtatgg gcggcggata 
113161 tgcaccgggc ggcatattgc taggaattca ttattcttgt tgatggccag aattctctgg 
113221 gcgtttgata tccaggcgcc gttgggggat gatgggaagc cggtcccggt ggatgatatg 
113281 gcgtttgatt cagcttttgt gtcgacaccg gaaccgtttg aggcgctgtt cgtgccgcgg 
113341 agtgagaaga ccaaagagat tgtggagagg gagtggaatg agatggagaa ggatatggcg 
113 401 gtgttgatgg gccaggtgcg agatagccag cgggcgcttg ggctggatgt tagggcgtga 
113 461 ggagtaagtg ggattaagcg aaggaaaata acatagtatt aggtatatgg cattaagtat 
113521 gtaagatgaa gctatacgtc tatcaaacaa atgtagtgta tcggttaccg atcgagcaat 
113581 ccgattcgtt tggccaccca ggcaaatata acagtcaatc ccaggatgaa acctagaatg 
113641 ctaaaccacc atgtgagtga attgtcccca tcatgtgtct tccctggcac tctgacattc 
113701 attccaaata gacccgtgat gaactgcatc ggcacgagaa tagaagcaag aacagttagc 
113761 ttgctaagtg ccgatgcaat accattgcgc atgcgtccgg agtcgaaatg aacctgagtg 
113821 aggtacttgg actgcaatcg cgagagcatc tgctcggcct gctcgaggtt ggcaatcatg 
113881 gtgacgacgt gatcgcagat atcgctaaga tacagggcaa cttgcgacga cgaggtgtcg 
113941 gagatatcgc agtgccgcgc aaaggagcgg atgacatcga tcttgtcgtg cagcggctga 
114001 cgaatattca tgacctcctt gcgcagtttg aagatgcgct gcaaggcgag tcccatgtcg 
114061 tcaggtcgag tgatggagac agagtcttct acggttacaa caccattttc aacacgactg 



114121 atgaacgggg caaaaccgtc aacgatatca tcgctgccag gcatcagtaa acacagccgt 

114181 aacagagcta gagtagtggt ggggcttaca tcagggcata gcaaatccaa tcgctggtca 

114241 acagcagatg gctgctgtgc tccttgatac gctgtcgcac atggctagtg tgcagactgg 

114301 ggtcgaagct gaagctgaga acgccgcctc gaaacaccac ggcgtacagg ttgtgcgaga 

114361 cgatacgcac tccagtgtcc gtctcgagcc gacgcggcgg tcgcagagag agaaagtagt 

114421 actgaccgaa aagttcgatc ttctctctgg tttcacgggt cttgatatcc tccgtagtca 

114481 ggggatggag gttgaagaat cgcgcaaggg cttcaatgtc ttcttccgat ggggccgtaa 

114541 tatcaagcca ccataggcca ctgtgagccc ctgtttccag gaggagctca aagggcctat 

114601 acacggaaca taagtcctcg atactggagg cctcgatcga ggtgttcagt tgggaggtga 

114661 agaagttgag ccggctggcc actgcaaccc ggggcttctc cagacggggg tcgggggtcg 

114721 aatcagagga catgattgca ggcagcgggg cgggctccat tccctaggcg gggaaggcca 

114781 gattgaagat tatgggcaca gctgggatgg aggagacatc aacggggatc ccaatgggga 

114841 agcgcagcgc atcctgacga tatggtgtgc gatggatgaa gcctgacgat gcgatgaaag 

114901 acagggagag agaaaagtcg agccggagcc gtggcggtgc acctgccatc ccgcctgtgg 

114961 gccaatcaga gcaggatcgt accatccacg tgttgccgcc cggtcagtct ttctagaaag 

115021 tgatgtattg cgagttgtga gacaacacca ttgacacttc accagatatc tctgatctat 

115081 caactctgca agaatcagtg gcatacatac agtgatttat atatagtagc cgacgctcaa 

115141 caggtccttc cgagtcctcg ctgtcggatg ctgaccaatc agagcggcgg atcgccatct 

115201 cctccgcatc gtcgcatccg gcaaccataa gcagcacgtg acgggcagca ggacagagag 

115261 atcactagaa cgaattgatc atgttaagtg agtcggctag ttgggtgaag caaaaacacc 

115321 gcttcctagt agtactactc tcgacccatc tgccattcaa gtttccatat tcggagcata 

115381 atacatgcaa gatatttgcc catttaccta gcccgtacat gtagacgtca ttgccagaac 

115441 gacgctagtt atcaatcagc taattcatat cattgtgact ccatgtccat gccccttccc 

115501 aataaatata taacaccaaa cctgtatgcc aaaatcagat gtatcagttt aggatttctt 

115561 tgtggtctgg ccgcatgccc cacccagcca ataccgcgtc agcaactgca cagcatctgt 

115621 gttcttccgc gtaaaatcgt acaatccgga agcacggatg ttgagcatca ctccatcgaa 

115681 atcggcctct aaaacacggg catggcgtgg acccatgaga gagagcacca gcatctacca 

115741 aagtaagatt agtagaagcc tttgatggat ttgtcggagg aagactgtac cggagcaaca 

115801 atatgcggtc gcagtgtgtt gcttcttaga cgggccttca tgatgtctgt aattgtcatg 

115861 atttctccac ggagtagttt atcatcgcca atggcatcat tttccacaat ggccttgatg 

115921 tggggaagat tttcgcctgg gtaatcagtc actgatacag cataccagtt aaactttccg 

115981 cattccagta attggccgaa cttgacatcc ccaattctgg gatgatgtag tcagaaagcc 

116 0 41 agctagcata catgaaaatt cgagagagaa gccacatact ctcggtacac gccgtacgga 

116101 gggttttggg gatactcgca cttggcaaac cctctgcaat catggaagtg gtcaaagagg 

116161 aagagctcca acgcacgggt atcccacata atgttatccg gggtcagctc aatctgaaga 

116221 aattctgcct cctcaaccgg aagaggcagc tcctcttcgt acggcctgaa gaaccacggc 

116281 tggtctttgg taccctcgag ggacaaaggg aaatggcgga tactggccgc cttcaggtcg 

116341 ctgggttcga catgcatatt aggtgtttgc ttgatgtgct cctcaacctt agccaaagcg 

116401 tgacgacgag cctgtcgctc aagaagaagc tgttgatcag acatggtgac tgagttgtga 

116461 gaggtgatga tgaactggga aggcaggggg gagagattga atattggcga cggctaccgc 

116521 ggcctgaggc aaacatctta ttgtcaaggc ctctttggtt tacggcgttt gggcaccttg 

116581 cttagaacct ctcagtgaca gcagctacgt gttaataaat atagatttag tctactatag 

116641 caacaagagg aagtgcactt tatcactgat acatgtggct caatgtcaat cactgcccct 

116701 tcatggcggc aattgatcct gccttctcgt tcacccctct ctctgttctc aacccattgc 

116761 tctgttaatg cggcgacaag gatattcaca tagtaaatca gacccgcttc ttcacaagat 

116821 gggtatgtgt gaatatacgc tcgataatct gagcgcactg gataaatttc ctgtaacaca 

116881 ttcccgtgga agttccagtc actgccatag tggaattcgg attatacctg ctcatagtga 

116941 aggcctccca cgcgcctaac cctgtataaa ggggcttctc ctcctgccat tcaagttaca 

117001 gctgacatca atcattctat attttcatta tttatataac ttattttaaa gtctacactc 

117061 tacaaatcga tataacatag cgtctccaaa agcaatttgc ttacaaagta tgggagggtg 

117121 ttgtttcatg cacgcgacga aaactacaaa cttccctctt caggaaaacc tccagtatat 

117181 cgattcttac agcagccctc cggacctgtc tcctgtgtaa cgttatcctg accgcagttt 

117241 ataaacaacc gggcagacag cgaaagcgtc atggtccgat gcggaccgac cgctcgcctg 

117301 gagctggccg cccaattcgg ccactgcacc acttgcccca ccaatttcgg tgttggatcg 

117361 cccatctagg gaggatgatg tgaaggcgtc gactatggat aatatttata cggactgatg 

117421 gctgcctctt aagcggatca tccagaaagg acgagcattg gtcgaagtta agcagctttg 

117481 aggaaggaaa gcttgaatac ctacccagta agcatgagga tgcgcagatg gtagccgagg 

117541 ttaaattcga tcacaccgga caagtattct ggcagaatgg gggacagatc catcaggtta 

117601 tgcaataccc ccagttactc cggatcattc cacactcttg tactaataca ctccaggctt 

117661 tatttatctg ctcaaacaaa gcatacgcac agacataccg cgacaccacc gatcaaaaga 

117721 tggtacccct ttagccacgt ccgtttcatc gtcagcactg gcagtaatac tttagtcaag 

117781 gtctttgacc tacacatgcc gaagcacaaa ataaataact atatcaatgc atcgtttaca 



117841 agattttcac gatgcaatcc gaatcttcca catgatctcc caacctcacc atcaactccg 

117901 ctaatgatga tattaccacg tgtcacgaga tcctcgtccc attgaccaaa tatgcagagt 

117961 agaatgatgg ctatactttc agcaaattta ccgtacactg gccccaggac cgcgtttgtt 

118021 tctcaaaatg cgttaatttg gaccctggag cacactgggt cgaaggcaat tgaaactatt 

118081 ggtgatgttc tgttaaaaat caccttacgt gcctatgcat acgccagccg agcgattggc 

118141 aagcaaacgg tgggagacca acatcggatc cagaatggca gtggtttggg agtgttattg 

118201 gaagcataaa gcgatcgttg tcttgttcac gcatcttatc aaaccgaaga tggttgccta 

118261 aatggaagct gtgagaatca atgggattac acatcaaggc actggaaacg cttaacaacg 

118321 gtaccatcca ccatcaagaa gaggttttct ctgtcaaatc tcttataaag tacctcgttc 

118381 tcttggcatt gttgctccca cttgcacatt ctcttttccc cgtaaaaaca ccacttctta 

118441 tgatgattct tgttgccaag tccactaccc tgatactgtt atggtaatgg ggaacattca 

118501 gtgcgcctct gtatcctcag ttatcacttc tacttgatat gtggtttcat tagcagagag 

118561 gatttgtgtt ttgtatccgg aacaggacat ttccgtacac tgtaccatca ttatgccgca 

118621 ttcaatcctt agcattccat cttccacctt gttgaacaat caaacactga cacatctctt 

118681 ccacactccc gggacaatat atctcccagt cccagtagaa cagcggggtg aaccccacca 

118741 aacaaactaa tcgtaggcca gaccctctag ctatagtagc cttcggaatg gtttccgagc 

118801 gggggcaatg tcgccggaag ggatcccgta tccagcctca aagctattca aaccccaccg 

118861 gcaaaacaaa ttatgtaact gggtaagcgc ttctccggcc ctcccacgcc accccaatta 

118921 cccagcacaa tgccaggcat aattcctacc aaaagttgct agcaccccaa agccacaacc 

118981 gcattactat gcaaccgtca taatgcacca tactatgcat ccagatccac cgccccggtc 

119041 agggactgca tttaccccag accagaccag acccaccgtt gcatcccaac ttgactcacc 

119101 gatataaaaa gcacccttct caccggcaca caaccatcac cctccaagca accacaacca 

119161 aaagtaatta ctactatgct caacgtcatc atcgtcggcg ccggcctcat cggcccccgc 

119221 cacgcacaat ctgttatcac caacccctcc accaaccttc ttgccttggt cgacccctcc 

119281 ccaagtgcgt catccgtggc taaaaccctc aacacgctct acttcccctc attatcagcc 

119341 ctactgtctt ctgtaccaaa catcccacac ccagacgccg ccataatctg cacccccaac 

119401 cacacgcacg ttcccgtggc tctcgaactc atcagccaca acatccacat cttactcgag 

119461 aaacccatta gcgataccat caccactgcg cttcccttgc tccaagcgca gcaaaagtat 

119521 ccagatgtga agatcctcat cggccaccat cggagattca acccatacat caccaaaacc 

119581 aaggaaatcc tcgagtccgg gtcactgggt ccaattattg ctctatccgg actatggact 

1196 41 ctatacaagc cagcatcata cttcaccggt gctaccgagt ggagaaggga taaggctaaa 

119 701 ggtggcgtat tgagtattaa cctcattcat gatgtggatc tactccacta cctattcggt 

119/61 cctattacgc gcgtttacgc agagaaaacc ctccctcagc gcgggggtcc agatgggaat 

119 821 gagaatcaca ccgcggaaga aggcgcggcg attacattcc ggtttgctag cggggtggtg 

119881 ggcacgttcg tggtgagtga ttgtgcgccg tcgccgtgga atttcgaggc cgggacgggt 

119941 gagaatccga ttattcctaa agtgggtggt gtaggaggag ggctctatag ggttttggga 

120001 agtcgggggt cgttgagcgt gccggatttg aagaggtgga gttatgatgg ggttgagggg 

120061 gagaaggggt ggaatcagag attacaagtg gaggaatttg aggtcgatga gggagtgccg 

120121 tttgatttgc aattggagca ttttgtgcgt gtgttagaga ggggagagac gccgaggtgt 

120181 gatgctgtcg agggattgag ggcgttggtg gtggttgatg ctgtgaagag ggcgatggag 

120241 acggaggagg tggtgaaggt ggagagtgtt gaggagattt tggcgaggaa tgaatgagta 

120301 gttgatggga tgaagtgagg cggatattag agattttgct agtatgatat cgaataaaaa 

120361 tattataaac ctctacttat ttgagacaga ctgcacaaag ctcgaatgca ccataatgga 

120421 ctgatcaacc gaacctcctc cctcccgttc caccgactct atggagtgtg gcttccttag 

120481 actcgcactc gtcgtaacat gattcgtcag gccgatctcc tcgtcatcct gtccatcaaa 

120541 cgtatacttc gtcgacagaa cactatgcac gcgtcgacca gcggaaggcg tgccattccc 

120601 gttggcattg ccgctacgag accgccacgg cgtactcggg ttagttggcc gactgcgctt 

12 0661 gctatcctgg ctcgaggaga gcttgatatg cgctaaccgg accaacgggg cgagatgagg 

120721 caggcaggcg cagacgaggg agaggcccgg ttcgatggtg gaccagatga agacgggccc 

120781 gagggtccaa gtgacgtcgg tgcgtttagt gaattgatcg acggagtaga tgcggacgat 

12 0 8 41 actggcgacg caggcactac aagccaatat aagtcctggt gcaattaggg aacagaggaa 

120901 aggggtccta caaactgccc acggccaaaa taccgctaat agccaccttc ttgcgacggt 

120961 tcatctgcag tttggcgatc tgggggaatg gaatcaacaa aaccacgatg tcgttgagca 

121021 tgttgatgat ccccagcgct aagaagaagg tatcggtgtc aatgcaggtg ccgcttgcac 

121081 cgccaaactg gttccagaag tatgaggcgg gtttacatgc tgtggccatt gtgacgcaga 

121141 tgatgatcgg gtaggagagg gtgaggaaga agccgaggta gatgccgatg cgcagggaga 

121201 ggtctgcatg ggaggccatg aagacgcgac ggtagaagaa aaggatcgag actttggcgg 

121261 cgcagcatac ggcggaatat accatgacgt agatgtagag gatctgagat gtcagtgcgc 

121321 ggacaacagt gctatgtggg gtgctgctga ccttgaatag tgttaccagc ttggtgacgc 

121381 tgatagccca gacatgtttg ccggcaccgt agaagcctcc tggtatccgt cagtctccaa 

121441 tcctaacaaa tagaccccaa tagcctgata gaccttaccg gagatactca agccggatgt 

121501 cgtgcagata aatacctatt tataaagcac aattagcgga cacccttcga aacaaattcc 



121561 atgtgaagac aagtccaccg caccaatgca agaataatga cccaatcatc ggccataaga 

121621 gaattgcgca agacacgccg ggcaacaaag cgtagcacaa ccgaaataac agccaagcac 

121681 agaataaaaa tgactacagc attatcctgc tgcaccctac tatccgtcaa gtcaatatca 

121741 tccgggggct cgccgaatat agcgacaatg gcaggatcca cggccatagt ggtccgccga 

121801 ccgcaacaac aacagcgaca gcagcgacaa gaaaatctca ccagcagatt gattaaaaca 

121861 gaaaagggtg atataaaggc gaccaggaat ggcccctacg gacggctgag catcctggct 

121921 gcgggaatgg cgtttcctaa gatcggccga caagcttggg ggccagacgg gcaatccaat 

121981 caatgctggt acatatatga ccccaatttc aagtggatat taagacaagc caccgtctct 

122041 ttagctcaac gctgacagca gccacacgca ccgataatta gtggccggcg tgggtctccg 

122101 gaggcacacg accgggggcg gattccttcg cttacactta gcctcaaccg ataccctaaa 

122161 caaacaggag atgcagcagt gtcaagccgg aggatggctg cctgttactg aacaaggata 

122221 atgtgcaata ttggcatgga cagttatagc cgtgatcatg acaacttggc ggaagtagca 

122281 gtgagcatgg ctaatactgt ttatcccaca ctactagtac accatgtaaa tcgaaccatt 

122341 ccccttaggc tgtatgccta gataaaccag cggggaaagt ccagtccgaa ctgcttgtct 

122401 gcagcgacag ataaaaagcc cagggaagag cagatcagcc ttgcccgctg ccaatccctc 

122461 aatccatttt accccaacag tcaacatggg ttccatcgcc cccgacgtgc caacatttac 

122521 tccgctcgaa ctcatcggcc cccgaggcta catgcgatac gtgttcacct tccccttagg 

122581 cgaagactac gtgctggagg atgtgtccag agccctgaaa gccggctaca acgccgcagc 

122641 ccagcgcctc cccattctag cctgcgaggc catccccgac ccgaccgccc gacaggatag 

122701 tgtaatgaag ctgcagatgc ttggaccaaa tgcagaagag cctatattag taaaagacct 

122761 ccgaacgcag gacacactcc caagctatgc cgagctcaaa acccaacact tcccggtggc 

122821 cgccttcaac gccgacacat tgtgccgccg tgacatctgg ccgacagatc taggcgaaca 

122881 tctacccatc tctctcgtcc aagccaactt catccgcggt ggattgatcc tcacatggtg 

122941 catcctgcac atggccggcg acggcaactc cttccagaca tggatgcgag tttgggcaga 

123001 ggaatgccgt cgcgcacaag gcgtgaaaat tactgaccct gtcgtactcc caccagctat 

123061 tttcgcggac cgtgagaaag tcatgcgctc ttcaggccgt aatccaggcc gaatagaaga 

123121 tcaccccgaa tacgtcgtcc tcccgttcaa gcctcccgaa ggcctgccgc cgaagatgct 

123181 ggccaagaac caccgcgctc aagtcttcta tttctctccg gaggctctgg ccgcgctgaa 

123241 gaaggaagca tcccccaaga atgcctctga gccgagggac gtgccctgga tttcgaccaa 

123301 tgatgccttg tccgcgttgc tgtggaggac agtaatggcc gtccagtggc ctctcgaaac 

123361 cctagaaagc gtcaaccagg atcaggaatc atttttcaac atcgccctga acggccgctt 

123421 acgaactgat ccccccgtcc acccgaacac acttggctgc ttcctcgaat acatcggggt 

123481 gtcgatgccc gtgcacaaga tgctcacttc cgcccagttg gccaatctag ctgtcttgat 

123541 ccgccaggca ctgcaccaag tgagcaacca gcacacggac gatgtgattt ccctgatcga 

123601 gcagctcgat gatattagga agatattccc ggcctcgctg gttgacctgc ctgggttcaa 

123661 ttgcatactg acttcgtggg ttgactttgc tttgtataat ttggattggg gaacgttgct 

123721 gggaggacat atcgagtcgg tgcgggcgcc gcatgtcggg gttataaatg gtttgcaggt 

123781 ggtgttgccc aatccgcggg agggcgggat ggaggttctg gttggggtgg cggatggatg 

123841 tttggagaag ctgctcaatg atccgctgtg gatgaagtat gcagtggcgc ggtgatgtgg 

123901 cctggctatc tcttccgaga tatgtatcat ttttattatg actcgttttg gtagcgctac 

123961 gctagggatg tcaacgcagg ttgttatgta gaagcgttag tacattcatc ggtgcatgct 

124021 gacatatatt catcgtctat ctggcggtag agagtaatat ctggattata tgaagtcgga 

124081 tattttaaca ttatatttcc aagcgaacag atccaacatc tgtctgtttt caaagagaag 

124141 tatgtacagg ttggagcata aatcactctt gacggcgctt acttggaacg acgctcccag 

124201 acatagggtg accagcctgg atcgcagttg aagtagctgc agggtggtca tcaggaagtg 

124261 taagattcgg gtgattcatc aaggagtcgt aagtattccc caaaatctct ttcaaagacc 

124321 tatgcttatc caactggtcc aactgagcct ttgcatcacg agctcgtttg ttccaggcgt 

124381 gcaactggga aacgagcatc atcgcgtccg ttttccatat tcgcagcatt tgcataccga 

124441 caggtccgac gactgtttta gtgaaaaaga aatcgtcaag gtagcgatca aagtgctcca 

124501 tatcatttcc acccatgtga acctttcttg ttcttacggc ggggtcttcg atgaatgtgt 

124561 ctaggctagc acaggtatag cagcggcaga ggatacatga cctgcccttc cagtctggtg 

124621 gcacgagagg agcaggaagt acatagccag ttatgtattt gttgagaatg tgtttaacga 

124681 atcgtcgttc actatttgtg gaagggcctc cagacgtaat gacattctcg caaacaccgt 

124741 ttgccactgg aagcaggaag tcggtaaaaa catcaggcat atctcttgct tggacgtcca 

124801 gagtgtattc cgtgagggca tgaaataatg ctgtgttgtc gacgtccata acatctccta 

124861 acttaatcaa cccgactacc gttttgggat cgatgcggat actcgtggta tgtttcgtgg 

124921 ggcggcagtt cgcagcaaat tcaagaccag agttagaggt atgcttgtat tgaacccgaa 

124981 agccgcggaa aaattttggt agaagattct tgagcattgc atcaaccgca tcttggtcga 

125041 acttcccacg ccttgagtag acatagaccg agacaaagaa tttcccaaga caagctgtgc 

125101 gctcaatctt agtctccagg aaaggtatga ttctgttttt gttagcgtaa tgactcagca 

125161 atcgacaggc ctccgtctac ctttcgaata agccctcctg gggcaacggc gtcagcacat 

125221 ccactagatg gcggccatct ggttctgtgt cgttcgaata gcaggacaga taattgctga 



125281 taatagcgct ttcccaattc aacatttcca ccgatggcgc tcggccccgt tcctggcaga 

125341 ttgcacgata ttcccggaac agtttagaaa tacagctgac tctctctggc aatgaggggc 

125401 catgcggttt gtcttgaaaa acacgttcca gcctaggcaa taatatcaga acgtggacaa 

125461 tacacgtggg ataagtctac atacgccaag cgaatagctt caagtccatg tttcgccagg 

125521 actttcgcag tttgagtaat ctcggaagaa gacatggtgc cttcaaggga cacaagagtt 

125581 tggcagaata gaatcacgtc atcaacatca atggctatct gcataacctt ggctctcatt 

125641 tcagacgacg aagagtgttt tttcaggatc gtttggcagg cccggaccag attccgcttc 

125701 gtcaaagcat tatcgggctg ttctgagaat agctggtgac actcatctag gaggtctgca 

125761 atgttatccg cggagtatat tagccggtgg agtatgaacg aggagcgaaa aacatttggg 

125821 acaattatgg caccctatga gttgtgagga cgaagataca ccaaagcaga gcaggacgtg 

125881 tagaagtaat cactagcggc atgaagactt actgtcgctc ggtaaacacg ggttgcctta 

125941 gccgcttcgt tgcccgtaaa gccttggtat ttttcatcat ccggaagtcg attcgagaag 

126001 aaatcctctt gaatcagcaa tttcccgtgg aatggcaaac gctttccgac ttcggaccct 

126061 gacgcgtcca ccactgataa gaacttctga ttgcctccga gcacatcctc tagcgcattg 

126121 tgagctttcc aataactgtt cccgtaccgt ccatcgacgc cacccattct ggtcttctca 

126181 atattggcta ggtataagtc gaaatcaagt ccctgacatg ccttttggag ttctccaaaa 

126241 cgaacttgat ctactccttt cagtcgatca aagctaagtt ccacaacagt atactggtgc 

126301 tctaggacat agacaagtgc tggtgggcag tcaccattga tatgatgatc ccaagctgag 

126361 tgatcagatt gcccatttcc actgcaatcg gagacccagg attcaagtag atgagttagc 

126421 ttgtctgttt gaacaccgga aataccaagc ccaactaatg ggcgatgaat caaattgtac 

126481 accagagcca ctctatatcc agaggtgacg ggtctcacct catggataac atccgcatac 

126541 caagcagccc acgagaagct aaattcagag gatgatgcag ttgaccatgc gatcttccaa 

126601 ccacggtggg aagcaacgag tgtcccgcct tcgtgcttcg aaggaagaca taccaccagc 

126661 gtagcaaaca tgccatcggc cttctcagtg tcctggtgtg gaaggaaaaa tgcgccttcc 

126721 tcgtagatca ggagcttata tagctcggcc tgaacctcat ccgattgggc agtcagaccc 

126781 agccccgtca cggcgtcgtc caccacgtca tccatctgct gttgccagag cggattttga 

126841 agcgaaaact ggccagggtc gagttgccat gttttgcgaa cggattcgtc caccagagtt 

126901 tcggaacctt tgccgtaggg gctgcgatcg caacactggg tgatggcttt tgcatcgtct 

126961 gctgagatgg gaagtcgaat gttgcctagg cctgcgatat tcaggccagg tatggctaga 

127021 cccagcccaa ggtcttgtgt taagaaggac ccagaggatt cgatggtctc cagaacggca 

127081 cacaggcctc tcttgagttc aatgtgacca tcattgtcgt ttacatctgc cactataggt 

12 7141 ctcattctgg ctccttcaat tgcaagagtc tcggggtcag tttgcgagat ggaggatgtg 

12/201 atgtcaaagt tggaattgca gttgagagac gcttatccgg ccgcgagtcg cgtcgcgtta 

12/261 tttgctgcga atagcggcca tcgggcaaat acaccatcac caattaatcc ctgtccatat 

12/321 cccatttcct ttgtctgttt tcacgactaa atcgatagtg ggatcccacg tgatagtatc 

127381 cctgccggca ggctggacat tcatctactg caaagtggaa cgcgacgatg aatttcatcc 

127441 accaacagcg aacccaacca ccactcatcc agagcctgtc agatgcggtt ccatgatact 

127501 taatacccct ttgactcgga caacatctcc ttcatgtgtg tggaggcgac tccgggaaga 

127561 gaattccggg tgcttagtga cccctatacc cccttaagcc gtcatgttct atatatagct 

127621 tcagatcatc cacgccatat ctgaccccga ctcgaggata ctattctgtc tgtcgatcga 

127681 cggatcaata accgcggccc ggccagatgt agatgctgta ctccgtacaa atcgactatc 

127741 taagatcgag aaagggagca tccacgtctg accgactcac cacagacgca ttttattggc 

127801 tcgtgctctg cgcccgccat taaagcgtaa agcttagtag cgacatagca aatataccag 

127861 catgcaacat gcagcaggaa agcctgcagc agacgacatg agaaataagg ctcttcacgc 

127921 aatttccctg ggacaagcgt tccatgggtc gagtccgggt cgcgcagtct cgggtttccc 

127981 ccgccagtgg tgactcctcc cgtcccaggc acgcaacacc ccctgaacgc cctactgata 

128041 gactagtaga tgcccctgca ggagattagc cgtcgtccac tggctatcgc tatctatgat 

128101 gcgaaggaga actagtcaat taagtagttt cgtccctcgg atgattagcc aaagagagga 

128161 catcaattgt caatcccgcg cagatgacag ccgacatggt cggggttttc cgactcttcc 

128221 attgggacgg atggggaggt aagaagtcta tttatttgtt tgtgtgcgag gttcatcgac 

128281 ccagaataca ttccttggca gctttccagt ccattactct cccaatcggc tagaattcct 

128341 tgattggcgc tcggtcgcaa gcagtgggga tgaaggcaaa tcggaattcc ctgatggcgt 

128401 gtggacggga caggcgttgg aatccattgc gagtctgtcc tactgtgttg caagctaaca 

128461 gttgcccgac gagtcatgat ttctccccgg cagcgctcgt cgtttcctaa ttaattccat 

128521 ccgtccgact gagggaccca ttttttgttt ggtttagtgc gtcagacatg caattgcggg 

128581 gactccctct tagggttccc ggcgattttc ggcaatttca ccctccccta ctattttcac 

128641 ttgcgaagga tttctttatc ccagttgatg ttgccaattc aatagtaaga gccactgaac 

128701 tcttagcctc cgtggcaaat acgattccca agcagcaaat gctcaaatga acgtggagtg 

128761 gacatctcgc ttagagaccg ccaagccgcc ggagaccatt ccaggacaat cgctatcagc 

128821 aatttccccg tttctctcct gcggtcaaat gagggtcgca gaatgttcga ccctctccgt 

128881 ttccaggagt tctagattag cagatcatga cggacagcgg tgtgtcatcc ggagggtacg 

128941 aatgctgttg gcagtcctga tccactttcc gggaaaagtt gcgcctcgca ccagcgagtc 



129001 agcgcatcga ctttagcctt tcggacgtta gcacagttgt ggggacgcct gaatacagtt 

129061 agatagactc cgtttgtcaa gccaaatggg cgatcactta aggtttagtg gtttagtagc 

129121 caaggggggg gtctcgtatt cccggttacc gaactgaaga gcagatggcg actccgtgcc 

129181 acgagccatg gttgtctcgg cgtgctcgac gaatccctaa tgtgtattag accaggtact 

129241 agcaggaatt tcatcagggc tgccatgatg gcggcgactt aacataaccg tattaactcc 

129301 ccactcatgt ccacttcccc ccatgttgcc tcaaatttat gtctgacacc catgtccttg 

129361 tctgcaccac aaggctgata tcgaccctgc tttcaatcgt gacgccatgg ccatctacgg 

129421 tggtgcagcg ccttcactgg cgggcagtgt cattatgttg acgacgctgg cattgttgac 

129481 ttatggcctg cggatttatt gcagggtgac caggaagtca tggagtgtgg aggattggat 

129541 catgacggct gcattggtac gtccagaggc tatcgaagct ttactggtgg ctaactgtcg 

129601 taggtgccat tcgctgtgct ggtagctgga tgtctggggg gcgcattcaa tggaattggc 

129661 attcgtgatt ctaggttggc agagccacaa aatgtcaaat accaagcaga aggacagaag 

129721 gtaggaacag tgcgagtatg ttcagctaca taatactgac tcttcagttt ttcctcatct 

129781 tcgaagtcgg ttactgctcc gccattattc ccatcaagct cagtatcagc tggatgttga 

129841 tccgagtagc cgagggtcgc aaagcgtacc tctatgcgca atacgtggtg attgtcgtct 

129901 ttgtgcttat gaacatcatc gcattgatct ttattctgat caactgcatt cctgttgagt 

129961 acgatacccg acctacctga gggttacagc cgatactgac tttacagcgc ggcctggaat 

130021 acggaactgc tcaagcaagg aggacactgc caaccatcct acgtattggc cgatgtctac 

130081 tacgcctgca cagccgtgaa tattctcacc gactgggtga cagctttgat gtgcgactcc 

130141 ttccagctat tatcgaatac cactgactga agtaggcctg tacctctgct ctggaacgtc 

130201 caactcaacc gcaacaccaa gatttcgatt gtcggcctga tgggcctggg tatcttgtaa 

130261 gacccgtttt tcctatacct tgccccgatg gcatgaacta actcatggcc tcctccacag 

130321 cgcctccatg tccgcctgcg tccgtcttaa atacaccgtc gccctaacca gccaatccaa 

130381 ctacctctac agcgtcacca acgtggtgat ctggggattc accgaaaacg ccctgggaat 

130441 gatcgtcggc aacgtcgcga cactgcgccc gctgttccgc atcctccgcg accgcaagac 

130501 atcgtccaac aacaaataca actctcgggg gtactacagc tctcagcgca ccggcccggc 

130561 aaacatgtat tctcgcaact acgagctggc tgagcagggg aagcacacta accagatcac 

130621 cactacctcg atggccgatc atacgcgacg acctagccag atgagcgacg gggacagtca 

1306 81 gaagcagatt ctagcgggcg gaactcctcc gggagacacg gatattcttg ttagtcggca 

130 741 ggttattgtg gcatatgatt agcattggtt ggtcggttgg tcggttggtc ggagttgggg 

130801 gcatcttgtg tactacattc ctttagaaat ttccttaatc ttattaaact ataaatgaaa 

130861 tatagaaaaa agaattgaat gacagcttac agcttcagtt ctccaagcaa gcataagtgg 

130921 gataggacca cagagtaaga attttaattt attgctgggc taagcaacgc caagccttaa 

130981 cggcttacag tgggggactc aatttatgcg cgagtcagac aatcgtttat tatcactcgg 

131041 gaaggtgaca gatcatacga tcaagcttgc acaacgtggt caatttgggt tgttccgtgc 

131101 ttgttcttta gtcttcttgt gcttgttctg gaagttgatt agcctgtatc gtattagtag 

131161 tggtactagt aagtttatgg tgtggcacca ccacgaactg ctatattttg cctactatcg 

131221 gtgtctccaa cttcaacaat aattaccccg gagggaggtt caatgccaaa ttgatcctaa 

131281 actttccact tgatcgtgtt ttatatatat catatattat tgtgtcgggt ttgaaatata 

131341 ttcttcttca atcctagtca cgactttagg ggatcaagtc agcaagtagc ctgatcacat 

131401 cagtggcaga taataagcag cctcatctgc aagcaccagt gattggtttt atcatattat 

131461 catcatctga cccatttaac tcaatgattt gagtatagta cactcagaca tagtcccacc 

131521 tctcccatta gtgtttcaac cgatgacatg atctatccaa ccactccata atatacaatc 

131581 tccaaccaaa gtaactagta tacagacaac ccaacaacag aacaatgaaa ccaaacacct 

131641 taattctaca acttaacctc caactccaac tccctcacca ccccacaacc cacattataa 

131701 acagccccat gaacctccac ctcccccttc tccaccgcct ccctgaccac ccccatcgcc 

131761 ataagcaccc tcaccccctc caacacattc aactccacca atctcacccc cctctcatac 

131821 tccccatgta tcccctccaa ctcactcgca tacctatccc tcaccctttt caaagaactc 

131881 aaccacgcat ccagcacact tttctcccca tcctccccat ccctctcccc cttcggggct 

131941 gccaacacag ccccaacccc cccgcaacac gtatgtccac acaccaccac atgcttcacc 

132001 ttcaaatgca gcaccgcata ctccaccacc gcggcgcaat tcacatcagc ggcattcacc 

132061 acattggcga tgttgcgatg cacgaagacg tcacctggct tcaggtccag gagggttgtt 

132121 tcgggacacc gcgagtcgga acatcctatc cagaggattt gggggtgttg ggcgcgcgcg 

132181 agggtaggga atagggaggg gttttggagg gtggtttttt gggcccagag gtggttcagc 

132241 gtgagggctt tcgtgtaggg gtcttggggt ttttcgggtt gttgggagtg gaggtctggt 

132301 tgggtggttg gatggttagt tggtaagaag agggggaatg gagggaggga gggaggatgt 

132361 acctgtgacg gccatggttg gtgggtgtat acgtagtagt atgtagatgt caagtttggt 

132421 ttcgattgtg gattggtgtc aaggatgagg caaagtatgt agaagacaga tgatgaggaa 

132481 cgaactaaat acttatctac ctcccataca gaatgctaca tatcttctct ccattcgaaa 

132541 ggaaaggaga tactcctatc gagtaaccat gccgtatacc cgtcccctgt ccatcattca 

132601 tcgctagata gatgctccat tgcatgatct ggatcgatct atacacatga ccctacagcg 

132661 cataatttcc gcgtccggaa catcctattc gtccatattc atcaaagcac agactttgtt 



132721 gtacgatcag tgaggcatat acacatcatt ctatgtagat aggtataaag gggatgcaag 

132781 atgagggatt aatataccga tattctaggg atatacttcc gaggtaatct ggatcaatca 

132841 tcggctattg cggaaatata aatagtagtg gtgatgctct tatcgcggtt gatagggagg 

132901 tgaaacgcta gtgttcgggt gcgtcggcac tgttggtagg tttgtcttgt ttcaggtgat 

132961 tccggacatg tggcatttgg atacggagtt tattatttat gtataaatat gcagaacagg 

133021 tatctcctct gttgtggtag gatctaattg tgtatcactg gtatataggt atctatctat 

133081 cttacatctt ggccttctcg ctctgggcag ccaagtcggg catcagcatg tctctcatgt 

133141 cagatgccgc ctcgttaagg taatagctca tgcgctccgc tccaaggcgc ttgcagacaa 

133201 tgttgaagtt caagctaaat cacagtcagc aatgatcaac catcacgata gggggggaat 

133261 ttggtaaagg gctcacctgt tctcgttgat catataagca atgccgaatc catcgtcaat 

133321 cacctgactc catccgtagc cgttgaagaa ctccgagctc agctggctgg tcgacaggta 

133381 ccacgagcta ctgtaggcga atgccgggtc ttcgtacaga gcaggcagct tctctccctc 

133441 acgcagcagc ttcttcagac cgaacaagtg acggtccaca ccacggccat cgctggcttc 

133501 cgccgtgtac ttcgtgtggg cggccagcgc cgagcggagt agacggacag cctcctcgcg 

133561 aggcacggca gggtcactga tggccttgca gaaggcaacg ctctcgtccg ataccgtgcg 

133621 gatggtctct gtacggccct cctggaactt gcgggtcgag gcagactcgt aggtcgggcg 

133681 gttcttgccg tacatcttga agtaggccag ctggatgatc atctgcacgt acgcatccgg 

133741 actgcacttg aacttcttga tgagaccctt gccgtagccc tggtaggcct ggacccggag 

133801 ctcgtgcgac gagatctgct ggcggtgctc cttgttggcg gcgtcgatgg cctccagcac 

133861 aggtccgtcc aggtggaaag tgatgggacg agggtcgggc aggtcagacc gcacgggctt 

133921 ctcggagagg tcgatcttgt ggttgaagat gaggttgttc aggtggtcat tcatgcggtg 

133981 ggtggggctg ccgtccatca tgctgtgctc acccatgaac ccggcggtac cattgtcgtt 

134041 gatgatgaac tgcaggggct tgtcgaacca gcggttggtg ccatcaccgt gccagtactg 

134101 gtgagcacgc tcctccaggg tgacaggggt ggcattgtcc agacagacga ggaatccgct 

134161 ggattcgatg tcccgcaatg cctgctcatt ggccgggtgc gcggccagca gcttcttccg 

134221 tgcgtcggcc cacaggtcgc ggttagcgac ggtcaagacg ccgaccgccg gagccggctg 

13 42 81 agcgacctcg tagatgcgct caaactggcg ctgcagctcg gagttgttca ggcgcttgcc 

134341 attgacggtc aaaggcacct tgaagaattg gttcttgcgc acgaccacaa tgtgttcgtt 

134401 gtcctgagcc gggaacttct gggggtagtc gacactgtca cccgggatac ggcagcagtt 

134461 aaacatgtac tggtaggtgc tcatagccat gggctggccg cgcatgtatt caggctccaa 

13 4521 ggagccatcg tcgacctggc gcttgaactc cagggcggca gtggcaaccg aggcggcgcg 

13 4581 cttggcgggg tcccgacggg cgcgatcgtc gcggtaggag tagaagtagg aaacgtaggg 

134641 gatgacaggg tcccggtagc ccaggtacgc ggcgtggttc caccactcaa gaagccagtt 

134701 cttaatcttg gggtcggcgg cgcgggcgag taacctttcc tgaaggggct caccctcacc 

134761 acccgggcgg acaaaggcct ccacggcctt cttggtgcgc tcatattcgg cctcggagac 

134821 cagcgcgtgg acggacttga ggtagcgacg gcccgtttct tccagagagg ggaccggcag 

134881 acgggggagg gagtcttcga atctgagcat gttgcctttg gagaggtctt ctttgtagcc 

134941 tattcattca caacgtgtca atggcggatc tcgcctcgga ttcgcatgtg atcgcttacc 

135001 ctcagggacc gaagatgctt tgcgccgggg tgccattatg ggtgtctaat cggtagaagt 

135061 cacaattagc tgccatttcc cacagacttt ggtggtatgg aggtagactt accacagcgg 

135121 atctggcgag gagtgaattg gcgggggaga gacgggtccg ggggagagtg gagagacggt 

135181 ttctgctcga agccgcgaac atgcctcgcg gttgtgaagg gacgggggga agatgagaag 

135241 gagaaagaag agttagagat ggttaagtag agagaagaga gagaccagag gaacgtgggc 

135301 taaaaagtag caggaaaaag gccacatgag ataagagagc gagcgagtca aaagacaagg 

135361 aacggagcga gagaggaaga tacctcggct ggatggaagc caggatcagc cgactgttgg 

135421 atggtcacgt attgtcatgg ctggggtgga tctttcacgt tggtccgccg actggtgggg 

135481 atgccgtgta tgtaacggta atctgagtaa tgctgggcaa ccggatcgcc gaggaactca 

135541 ctttgggccg gtggagacgg aattgaccca atttctagcc atcacgagat atcatacccc 

135601 ttaccagttg atactttggg acggacataa accccatggg gtacggaaca gcttgaacat 

135661 agtagtccag cgctgcaatg ataatccagc tacgtcgaca ttctataaac taaactgtct 

135721 ctccattcag ccacggatct cgcagcagct gccctgccgt ctttctatct tcgggtcgcc 

135781 actgaagcat ccctctcatg aaattcaaaa atgcctcctt gttgcttcct tcaaggtatt 

135841 cctctgacga ttggagtcct tcggtatgcg ggatagcgat gtcttgcttc cactgtccta 

135901 cccgttttgt tcagtctgat gccaatcgct tctaagatac taagataaca taccgtcttc 

135961 agtaaagaac tcgtggcttc gcttcccccg ttgcaataga tccaacggag gatgccccaa 

136021 aagaccaatg acctcagcga gatgcgcacg tgtcgagtat ccttttccat caggatcctc 

136081 cccgtagaac atatgcttgc cttcaaacag atcccagatc tagttgcccc atcagaaacc 

136141 tttctcagat ttccgagaat acgataggac ataccataac acccacgttc cagatatcca 

136201 ccagataact ccaccccacc tgcaacatga cctccggtga cctgtaaact gcaggctgcg 

136261 catcatgatc cctgctctca tctccgcgta ccgcagaacc aaagtcgctg aggaccgctc 

136321 gaccaaatga tatgggtagc ccgaaccgtc gcgaagcata caccggcacc ccgtcagcga 

136381 gtttgcgtga agaaggcctt tccatctcac cggcagtaaa tgcttccatg agtgaattat 



136441 ccctaatctc ttggagaata ttaccgccct ttatatctgc aacatagacg catcagtacc 

136501 tgttctcctt tcccaccaca aggacaatcg caccagtatg gaccaccctg cattcagtat 

136561 gtaagtagtc tagagcaaga aatacctgca tcaaaccagc tttgagcaga tcctccgtaa 

136621 accgatgacg gggattgcga aacagcaaat cggtaaagct atcccacatt ggttgctgga 

136681 ctagacaggg atggtcgcct ccaggacggg gaatgatgaa agtatcaagc gccgttcgta 

136741 catagccgta tcctggatga gatgggttgg cttgacttat gtgctgatat gttttgaact 

136801 cttcagtatg ggattcatcg cgggtataaa ttttgagggt cacgtagcga tgagatctga 

136861 tcatctagca tcagtccgca tgttggttat atgaagcagt gggagaacag actggagatc 

136921 ctgagctagc cagacagtcg atattgatcc gtatcctagt ttcccaatga cctgatactt 

136981 ggaggcgaag acatccccaa tgttaacagg gtagtaatgg ccattcgtga aggagttgaa 

137041 gtgctcttcc tcgaggattg tggatgccgg aataattgtg aacccggtag ttgggaagcg 

137101 ccgaggaggt aaaggagctc tgcgaggatg gatagatgct cggatccact tgagcagaga 

137161 agccataaaa gcacatgcaa aatgggtgat acagctacct gaggttattc agcatcgcca 

137221 acagggtgaa taaaggatgc attgtttact gtatcaggat tactagcggt aagagacaac 

137281 cttaaacgga acggtacaga ctagtaaagg tcgcattgcc ccagactcgc tgccagcctc 

137341 ggtccgtcga ggcacagcgg caacaaccta ccgggcagtt ggaagccagt tctatggacg 

137401 ttctttggca tagatctagt gggaagctta actagaactt tagtcgtatg tcctcttcgg 

137461 agatacacta tgtactacgc tgcagctcag ccggagcagt cgatggtcca gccgcagtga 

137521 gagcggaatg cgggggagct gctggactgg ctcatatggg ggaccgtagg ggatgaggga 

137581 gtccctcggt cgggggcaac tcggtcagca gagcagacgg gactgacctt ctcatcggca 

137641 tctccacttg agacagcggc gtcatcacta acaggattgt ttggcacgcc gctgcgtcac 

137701 tggcacatat tagtctggac agcgcatctt gcttatagct gaatgcacca cgtgaagtgc 

137761 attgcaattc ctttcatccg tcccgcgtct ccgaatctgt caggcggtca tccaggatat 

137821 tgagcatcga atggcgagga gagtctgatt ggaagcttgg accattcaca tgtttgtgcc 

137881 agccattgga atattttgtc atgggtatca atacataagc catctatcta tctatcacta 

137941 tccagcagtt cgccacatat gctcctattg gattcgaggg aaatatctac gcaatcagca 

138001 tccgcgacgg cgcagctgct tatgactgct cccgcgaaac ctccgcgaca aaatccatca 

138061 gaccaggctt agaatctgtc tgagtggctg tccacgtctg agactgcaaa cttggtgcct 

138121 tggggccgcc cgatcggcca aacccttcaa gtccccgaag tgaaaatgtt tccactttca 

138181 attcctcccg tgacaggctg attttcagct tcttgctttg gccagcagtc gaaaacgcca 

138241 gcagttcgcc gtggtgcgga tatgccgcct cccacgtcaa agtcccctcg gcgccttctt 

13 83 01 tcttggtggt atttttggtc gtcttcgagc cctggagcgc gatggccgac acgatgtcca 

13 8361 acttgacgaa atgcatgaga gctgcagctg cttcattctg cagtccaacc cgaccgggag 

138421 gctgaacttc ggggtggtga ggtaggtcaa tggaaatgtc gtagtgggtg ccaaatggcg 

13 8 481 gcgacaccgg gttggtgcgg attttcacgc gaaaagagct ttgaggggtg atcagatggc 

138541 cggaaaatgt cgattcgaag ggcatcagga actgttgtac cagactctct accttcggaa 

138601 cagcctgcac cttcggaagg ctcacagacg caaaaggcac ggcgtggaaa tcacgctcga 

138661 tccctgccgc ccgcagaacc cggtatatat cattcatgaa agactccaac cagcggacgt 

138721 gcgagctgtg ttggagatac gccatgacag ggcgcaggag ctgatactcg ggggtaagcc 

138781 gacgtttctg ggagagcggg ggcggcggtt gggttctgcg tcgcaggttc tggcggtgtg 

138841 catacgcgag caggatccga attgagtgcg agagcccatt agcaaggaca tcgtgttctg 

138901 ttgactcgat cttgtcttcg tcgggcgttt cacggtccgc atctaccaga tccagcagga 

138961 tctcttggtc ttcggatatc ggcagtcgga ccaggttctg acgagtcgtc acaccatgac 

139021 ttcccatgat acgcgcttcg cgcatcaact cgtggaatag ctcttcctcg tacactgtgt 

139081 cgcgtgcttg caaaatcctg cgctcgaccg acgctgcatc ggggatcgga tccggcattt 

139141 tagagcatcc cgtaagttgt ccgcggtgct tcacacgcac tcgaaccgtc ctcgttttct 

139201 gtggtgccag tcccttatcc agtattaagc ttccattatc agccctccgt agagcagcaa 

139261 gaccccggtc gcggaatata ggcgtagcta taggaacaca gtcagcatca atccagcaaa 

139321 gcacgtagcc atcctacctt ccaagaaccc atattgtact cccagcgcag ttctctcccg 

139381 gggcaacctg cagaccttcc atcccttctc cttaaccgcc aatacttcat cccagtactt 

139441 cgtctccgaa gcgacctctt cttccaatcg cgtcgccgag ttgagcaatt tattcgcagc 

139501 ggcattgaag ttctgcgccc tccaaccccg cgacacagtc ttgacatcct gcacagcggc 

139561 ctccgacttc ggcggcggct cgacgatctc cgcattcaac gacccaagcg gcgcaacctg 

139621 tttcaggaac gccgacatcg aagtctccgc ctggcgaggg gtatgcttcg aaagcaaaag 

139681 ggaaacaaaa tccaaagcga acaaagcttc catatgcgcc tgccttcacc ccgcaacaaa 

139741 tcaattagcg caattgccca ccattgctac cagcagacca cctacttaca tagcgaactg 

139801 caagatctca gcacgccgct tgtacaatgt ttccagacgc tcggttgagt ccacatcctc 

139861 agcaagtttt tcgtcctcat cttcctcttc ccatgggtcc cggttcttgt cctcttcgat 

139921 cttggcgagc agtgtctctt cgttcacgtc gcggaaagat ccccatcggg cgttgatttg 

139981 ggcgatctcg attggaaggg tatcggggcg gtcgggcttc tcggtgagcg ggcgcagggg 

140041 gaggctgaat gagtcggaca tgattgcggt ccgctctgcc gggggcagtt tgagttgaag 

140101 gcgtattgtt gtttgattaa tggagagatg gtgacggatt gattgcactg ctgataggag 



140161 gcacgtgact ggtctggagc tggggctagt ccggatgtgg actggtgtgg atagtgggta 

140221 caaagttaat gcctcaggaa gcaacagtta tggttatagt attctaggat atactacata 

140281 tatacatctc tctttgctat caagcgccat caccaagctt gttcccctaa aaattgcagt 

14 0341 agagtgagca agctctgtta gtctgcattg ttgaagcaag gccccggcca cttcttagga 

140401 agcataaccg gcgacagcga catgggcaat agcccactcg atgaaaataa agtgtcacca 

140461 ggatgcttca gtatagcacc caatattgct gaaaacaaag ccttccgcat ttagattccc 

140521 acattatata ccgtttacag atcatgatgc tcgagaagtg accaaaacaa attatcactc 

140581 cagccaatga tatgtgacca accataccgt gacgccggtt ctacgacagg ggataacagc 

140641 gtccaaacat ccttataggt ggaacacata tgcgacctga gcacatcaca gccttgctca 

140701 tgacaactca gttcacnccg cttctgtcca gccttgcgat acttccaatt gacctggatt 

140761 ttaagaaccc cagatactcg tccccttcaa cctgaacctt tctctctcca aaccctcctc 

140821 atcccacaaa ccacgaataa caacaagaac caaaacaaca agaagcaccc ctcacaacta 

140881 tgccctcccg aaaatacaaa tccaaatcca aatccaccct ccccggcaac cgctggtcca 

140941 tgtacccagc ccttcacgcc gatgtcctca cccagctctc atcctcgacc ttcccaatca 

141001 cctctctaac cttccacccc ttcgacgact ccgacacttg cctagaagaa tatgacacga 

141061 acatcatggg tcgattcatc tgcccgaaca actcctgcgg gtcaagcggc tggacgagca 

141121 agaagatcgc catcacgatc cggctgtacg aaggggatga gtacaacgcg cgggtgtatc 

141181 atcagcgctg caagggatgt aatgcgctta gtcggccatt cctggatgag aagagctatg 

141241 cggagaggat tgcgtatcgg atgaagaggt ggtatgatgt ggatgtggag aggccggtgt 

141301 ataatgggag tgggaattcc aagccgcata atagggattt gtgtgagggg tgtaaggtgg 

141361 ggaggtgttc gtttgcacgg gagcgtgatg tcttttttgt aaggggtggg ttcttgtgaa 

141421 tgagatgttg gtcttaatat tggtattgat tggtcagtct gttgttatct cacattaagt 

141481 agcagcatgg gatattgagc gatatctgta atgacagaca tgaatcaggt ttgcttgggt 

141541 acaaagtgtg caggattaac caaggtgtga attgaagata taatcaagga cataaaatta 

141601 gaaaggtatc gactaagcac ctccgagcaa atcaaaaaca gataacaaaa ggtgccgttc 

141661 aatcttgggc gggtgcttgt taagcttcat tgccgagaac acagccatct gggcaccgcg 

141721 cacgctgtac tccatgctca gagtcgtatc atcctgaatc tccacatact gtcccaccaa 

141781 ggccagattc ctgctctggg gaggaatgac ttcgggtcga tagtcatgac gacggggcat 

141841 cagtggtgcc gtggccatgg gcatcagcac agggttcgtg atcgaattag cgagcatgtc 

141901 atccactggc attcctagct gggatagcac ctcggtcaag atctccttgc cggagcagtg 

141961 gcacatgggc ttgtgaacga agttgccggt cttctcggga tgcaacccat agccccagaa 

142021 tacatcgacg gaggagggtt gatcagagca tacgggctga cggggaacgc tgatactgag 

142081 gccccagttg ctgtcggaga aggaaagcaa agcccccgta ccaggtttgt tgttggtcag 

142141 cttttcgtag cgctccatga agtcgctgtc gcggagggtg actgtgaagg actccagggt 

142201 cgactggtcg acattgttgt ggaagttgaa cgggtttcca tagtccgtcg aggcctgcat 

142261 caggtcgatc cacagagccc agtcatcatc caagaattct ttgctgtgct ggggtggagc 

142321 agccggagcc ttgtcgttgg agcccctctc ggttgcagag gtcgttgatc caagggtcac 

142381 catgacaagg tcttcgggcc tcacttcaat ggtcttttgt tcgccgttat ctttgataat 

142441 aatcccggac acggtcgtct gggcctcctt tgggttgatc tgcagatttg taacctttgc 

142501 attgaagtgg aagtcgactc cctgcgattt cagatatgat tcgattggca tgataaccga 

142561 ctcgtaaatt gtgtacttgg ttcgatccag agccttcacg tcgttcaggc gttcaatttc 

142621 ggccagatgt ttacagagac atcttctgaa ctccaccagg ctgtgccatg gttgcagagc 

142681 gaagctgtat accatcagca ttgtaacagc gggcaaacat agatatgaac ctacgtagtc 

142741 gcccacaacg tccaaaagtt agtcttgaag aatgcctcat cgaaagcatc ctggatggct 

142801 ttttctccga gaatacgttc attttcgagc atcaacttga tcagttccat tcgatacttc 

142861 ggtccaattt ggagatgttt ggagtagagg cattctgggc ctttatcccc atgtttaatg 

142921 agacgggttg gcgccgactt tttgggagaa gttgcccggt caagctcttt gatgctgtcc 

142981 agaagtgact gtttgggatc atcgacgctg gggacaaggg aaagaaggtg ctctacacag 

143041 cgatcatgga agtaaggcaa actgccagtg tacaagacgt agccgctctt tgcgtcgccg 

143101 cagcttttga tgccaccgcc cgcgtgagag tgaacatcca ggatgtggat atttgctgca 

143161 gggacacccg cgtcatgaat gaggtggact gcgccggcaa gagaagaaat cccactgcct 

143221 atgagccagg cctgggtggt ttttggatcc cgcttcggca atccgtttcc gcggttcatc 

143281 ttgattatat tgattgcctc ttagccgtag tcgtcgttgt ccgatgatat atattggata 

143341 tttgaagtac aagtaaatta tgtgcaggag aatgacaggt acatagcatg gactacgtac 

143401 tgagccgata tgagtacatg taggaaatta ttgcaatact tgggagtgtt cggaagccac 

143461 agtgtcatga tatcatcacg atggaccgac tgtcatgaga cgggaagcag tatcacgagt 

143521 aatcggagca taaacagcgt agtcagtgcg ctcaaaagat ggcctctatc attattggcc 

143581 ggaaggcgtc ttccctctcg tgaacctagg agagctatct agtaggtccc tgtccacgtg 

143641 tgggaaggac ccgcggatgg ggaaacctta agttgcgcag ttcgtgatct gggcagatcc 

143701 atccatggcg ggatcagcca cgtctaatgg aacgtcaggt ttgcggcata aggaagccgt 

143761 caaattgtat gatttgtgcc aagtagctga tacgttgcac tactgcagaa gctttgagct 

143821 tcctagtggc cagcttggag acgtttccat gtcacctctg gaggtcttct cacctaccag 



143881 ctaaacggtg atcatccgtt ggcacttgat aggatgcgca gataaattgg ttcccatcct 

143941 ctgccgtgcg gccggggcca caaaatccct tgtcggcttg atgatagttg catggtgaag 

144001 cattcgaggg catgtaacca atatttcagc catgcctttc ctgtcatttg cccgtgcgct 

1440S1 ggaactgagg aggcagctgg aaggaaccag agcggaggtt gtctgtatcg gctccgatga 

144121 ttatgcgacg agtattcgcc gatggagtga cacgtgcgaa aaagaagctg tatgtgaact 

144181 gcgaccactc actcttgacc ttgataatta ctattctgac caaacatagg gggctgttgt 

144241 acgtgtaacc agcacgtcag aggtggcgga ggtagtacgc ttctgccgga agaatcacat 

144301 cgacttcgtt gtcgaggctg gtggacactc caccaccgga gcttcctcgt cgcatggagg 

144361 cgtagtaatc agcatggcta gaatgtgcaa ggtcttgaca gaccctgcct ctgagacggt 

144421 gtgtgttcag ggaggcgcta actgggatat ggtcaatcat tcaactgctc cttacgggct 

144481 agctgtggta ggcgctaccg ccagtcactc tggggtaggc gggtcagcat taggaggtgg 

144541 tttcggatgg ctaaccgggc agcacggcct gattgcagac cagttgctga gtgtcaagat 

144601 ggtcctagcc gacggcagca ttgttgaagc ctcggacgag gataaccaag atcttttctg 

144661 ggcggtgcgc ggagctggac aagcctttgg tgttgctacc gagtttgtat tccgtgctca 

144721 caaagtccga gacaggttct tcggagggtt ggtatactac gatgtggata agctgccaat 

144781 gctggtctcg tttgcaaatg agttcgataa acgtcaggat ccgaagagtg gtttcttttt 

144841 tggattcgca gctccacgag agatagggca catggtagtg ctggccgtac tcttctacga 

144901 cgggtctgcg tacgacggcg aagcattctt tgagcctatc ctaaatccga acccgctgat 

144961 caaccgagct gccatgaagt cgtacataga gatgaattca attgctaatg tcgacccggt 

145021 gccagaaggc cgcaaaagca tcggtggtgc caacatcatg ccacctctcg agacaagcct 

145081 tttgcagaat ctctacagtc agtttaaaga ggctatgaat acgtaccctc gtatggaaga 

145141 cagcgcattg gtcttcgagc tgttgcctta tacgaaggcc gtgcaggttc ctattaagga 

145201 gaccgcgtgt gccaatcgag gaccttatta caacgttggt ctcatattat gttggcatga 

145261 ctctgatcta gatgcgaaga tgcatgctct tcaacggagc attatctcca aaatcttgga 

145321 agcgcaacga gacattactg atgatcatgc tgtggtgtat cccaatcttg ccggtcagta 

145381 tctttgataa tggtgcatat caggactccc ttgctaatgt gcgtaggaca tgacgttagt 

145441 gccgagaagc tatttggtgc taacctgcct cggctacaga agctgaagaa gaagtatgat 

145501 ccgcataatg tattccgcaa gtggcatgat ctacttgcac cagcaagatc ccatgtagag 

145561 cagacagaca aaccctagtg tcagccccac atctggtttg tgtgtgtgca atttccgtta 

145621 gtctgatgtg aaagtgaatg acaaaattgc tttatctgtc ataaaaagcg ggtattgtca 

1456 81 gtagtcgagt gcgaaatacg tgattgtccg cttgtgtgcc atgtcaggca acccaggata 

145741 taggggatcg ccagtgactg aaagagcagc cacaccggtt tgtaacagtc tgcatcttgc 

145801 ctccagcctc atccagaaat tcaatctatc agtagccgct ctgaagatgc ctgttcatat 

145861 ctccgcggga ttgatgatac ttggttctgg ctcgttctaa ggtcaatcga gtaactaaca 

145921 gactgctaca cccgccattt actccctagc ttgacatctt ggtaatgctt ttgcagcgtg 

145981 caagtataca cctcaggaga acacaatcat tgccatcggc gatttgttga aacttgcaag 

146041 gatcgaatcg aggccctgct ccggccgaac caacacccac ccttccgaga tgtagccgcc 

146101 gctggccgga tattacacgg ctgctccttc caagcgatag gtcaggctat gttatactcc 

146161 tggatcatac gatgccagta tactgatggg tgtcattgct gcgtcagggc gagaggaggc 

146221 gtttttgttt ctctgggacg ccgaaagctt ccccgctcca ggcccggcgt tcggcaacaa 

146281 cagcccttgt agggttctac gggaagcaac cccgaaggca atagcccgta tactagaggt 

146341 tatatgagtg ctataatgtg ttgatttgac atctcaagta taatcagtcg tggtaaagtt 

146401 aactgtaata gggtggtaat tatcatacga aaatgagttc cgctaggtcg cctttcgacg 

146461 cagagtcctg gtctgggaaa ccactaaatg tgtgatgata tgttcatctt aaaacagacc 

146521 gtcaggatac taacttgacg tctaggttat atacgctgga attacagagc tcgtctgcaa 

146581 caattcgagc ggaagagttg caattgctat ccgcaatttg acagatctgg tcgacttcct 

146641 tgtgtgtact tggcacgctc tgcgtccaaa cgtgtcggac tatgcaaccg acactataat 

146701 tgcagagctg aaagactaca gggagaagaa tactgagaag ataattagcg ctgctttgca 

146761 tcaatccctt gtctatagat gtccgtcact atgctcgcgc ctatggagcg aactcgatat 

146821 agtcccttta gttttggagc ataaggatcg tgaaagacga catgatgacc agggagagct 

146 8 81 agcgacgttt gccgggtgga ataagaagga gctcgacgag cgggcggatt caatggtgcg 

146941 gaagtgcatt cggttggtgt tcccttgatg gtgagggcta aatactgact tgtgtagttc 

147001 ttttggcatt ggccatgtgc tacataatca tataaatttc gacgggtcgg tggatgtcga 

147061 cagaggctat catgtacacc ttgctaatgc gaaagactat gagaagactg ttgaccctgc 

147121 tacatggtct atggctcaat gttttgccca ggatctgagg gaaagggagg tgaaggtggc 

147181 tttcttcagt atgacttgcc aaggaaaacc agatgttccg accagacatg ccttgtcacg 

147241 gttcaccaag agtctgggag ttcgtatcaa atggtatgac tctcatccta tggggtcaat 

147301 ctaaacgata ttcttacatt gtgcccacag gtttgtgcca aagccacgtc cggggatgat 

147361 tcccctcatc cggaagatgc aggacaccct agagggacag ggcgatcccc tgtcggatat 

147421 caccatcaac gatgagctac tcatccttga ctttgcatac gccaacgcca ggaggtattg 

147481 gctgtgcgag aatggaccgc ttaggccccg tgcggaggga ggcgtggacg tagtgatcat 

147541 tgacagcgcg ccgcttctaa cattggcaat tctttctaag cagcaagatc ctgagcggcc 



147601 tgtcatattt gagagcagcc tccagcccca gggagtctca ctcagtggga ctagtagccc 

147661 gcagtctcga gcatgggact ttatacgaac ccggctaacc cacgttgatt tggtggtatc 

147721 gctgctccct aaggaacttg cgccgcgcat aatgcctgag gagaatgttg gctacatgtc 

147781 attctctgtg gaccagtact gtccctagcc ttctatgatt tgcacggatg ctgatacagt 

147841 gtagactcga tggtcaaaac aaacctctaa ctgactggga cgtgggcttc tacggtcgtg 

147901 agttcagctc gctatgccga acgcttcaaa tgtccatcat acgctatcca gagggtaata 

147961 ccccagacct gcccccttgg ccaatactaa attgtctgca cgtagagcaa tatatcctcc 

148021 acttatccca attccgaccc ggagacggaa cactttgtct tctccactca taccagaaat 

148081 tctgcgatgc ctataccaag gagcatccag gccgccaagt ccccaagctc ctgatctgtc 

148141 accgcggacc cttccgaacc ccagaaagta ccgttttcta cgacgcagca atgtcccaaa 

148201 ttgacagtag tgagacccta tctacgtccg tctgcatcat tccaatcgga gccgtcgatc 

148261 aaatgtggaa cgccctactc acgaatgcaa gagcgctcgt ccagctctcc acgctgcacg 

148321 gggtccctga aatgcttctc gcggctattc aaaaggggac gccggtgatt gctgttcggg 

148381 aagccgagct gttccccttt gtgcatgaat cagaaaatgc tatcctggtt gacaagggtg 

148441 atgaggaggg gattgctcgg tgtttttcga ggattttcag tgttgatgag gttcgtcagg 

148501 gggaaggaga tgcggggttc aggaagctgt ctgactcgaa tactacggtt ggaaatgcgg 

148561 tctgttggtt gtatttggcg tcgaagatgt cgagggggga taagtttgag ccgagggggg 

148621 cagatattta taagttggcg agagaggagg ccgggtataa tgagtgtgat tggcatgcag 

148681 tgtagttagt cagtcacaat tgggaacata atgagattga aaaatgcgat agctgttttt 

148741 gagtgtatga agttcctaca gagctactgt tgaatactta atatatattt cacaatgcat 

148801 tgccaggatg atcagtagta gtatgatagt actaacacaa tctcacgttt tcccaattgc 

148861 agttagtatg attgattgat cctgctatta ctagtactag cttctttcat tgtatataga 

148921 taaacattat aagcagattc acatcttaaa agaatgttga ttgcgcactc aataaattca 

148981 cttcctacag atatttttag agtaactact aagcgactcc tctgattggc caatgggtcg 

149041 tgggtgggcc ctgtggatcc cttacagctt cggatccact gtagaaaaaa caatttgtat 

149101 gcttccccat cacgaaccac accgcgagca gatgaaggct tcaaaagcaa aaagtaaagc 

149161 ccaacgcggg gctcgaaccc gcagccttga gattaagagt ctcacgctct accgattgag 

149221 ctagccgggc agctcttgag agatgcttgg ttaagagagt ataaaagggg ctgggaaatt 

149281 gaactcgttc atgcgatgtg agaggcaacg ttcacacggt agtaataggg acagggaggt 

149341 agaaaggggt gggaatatgg gtgtcccatt attagtgaaa gtgttagtag acttatctga 

149401 attgtatgtc gtcttctccg agcactgatt gcaggttgtc atggacgggt gtcagctagc 

149 461 agggatgtgt ttgcctggta gtcaaggctt tgtgtctggt gtatgtggtg gtgccactga 

149521 tcttgtgata gtgtagactt tcattatatt tggttgcacc gggtgctgtt tgggcagggg 

149581 ggtagagata aaccgcgcca cttgctgagt gtacagatcg tatcaccagt gttgcaccac 

149641 ggtctcggcg tgatagtgcc attgtatcga gtgtgtagcc ggcatccagt ccacgtagga 

149701 ttcaaatgca gaagtgatgg atgagcaacc gtatatgaac gcatacagta cagggagaac 

149761 atggatagca ggagttgact atgttgtcat atctccaccg ccaaaccggg catcgggggt 

149821 ggaccacagg ggtccggcag ttgacccttc agcataacat catgcatgaa taaagtccgg 

149881 ataataggca atcagccctt tgcagggatt ctcaaatgcg atgaaagaaa cagaaagtgt 

149941 atccatcata ccacttgtca cctagatatg gtgctcgtga gcaaactagc actgtagcac 

150001 tgtcgcagca acgttcaatc agcggcgagg gattaggggc gtgtctgcct aattgggact 

150061 agtgaagata gctatcacgt gcagcacggg ccctaaccca taaattacgt gagctgtccc 

150121 gcactagtcc taattgagtg caggcggtga catcaggtgc tttcaggact gcaagaggaa 

150181 atgggcggag gaagacattc cctgccaaca cccaagtcgt atctgcccat tgcagcgtgc 

150241 agcaaggtca cccggccgat cagcattcga cgagcatcta gccctttcac ctgcagacca 

150301 gttatctccc cagtggtggg ctgtttactc tgtacaaagt acttttccca gcaacgagag 

150361 agggtatttg tatagcgggg cagtgacggc cggagtcagt gacgtactgg ttgcccacac 

150421 ccgctttttc tcaccggcac ctgcgctgct cttctccccc acccttcaaa cacaagttat 

150481 ctctgccttt cctccgcaag ctgtgtgctc ctctcctctg ctcctccatt actccattct 

150541 ttcttctcca tctttttatt ctaaatcctg gttcagctgt tgctcatccc gaacaattgc 

150601 tgaacacccc ccaacaaaaa ccatcgcacg atggctcccc cccaagctcc cgtgcaaccg 

150661 cccaagagac gccgcatcgg tgtcttgacc tctggtggcg atgctcccgg tatgaacggt 

150721 gtcgtccggg ccgtcgtccg gatggctatc cactccgact gtgaggcttt cgccgtctac 

150781 gaaggttacg agggtctcgt caatggcggc gacatgatcc gtcagcttca ctgggaggat 

150841 gttcgcggct ggttgtcccg tggtggtacc ttgatcggtt ccgcccgctg catgaccttc 

150901 cgtgagcgcc ccggtcgtct gcgggctgcc aagaacatgg tcctccgtgg cattgacgcc 

150961 cttgtcgtct gtggtggtga tggcagtttg actggtgccg acgtttttcg ttccgagtgg 

151021 cccggtctgt tgaaggaatt ggtcgagacg ggcgagttga ccgaagagca ggtcaagcca 

151081 taccagattc tgaacatcgt cggtttggtg ggttcgatcg ataacgacat gtccggcacc 

151141 gacgccacca tcggttgcta ctcctccctc actcgcatct gtgacgccgt cgacgacgtc 

151201 ttcgatactg ccttttccca ccagcgtgga ttcgtcattg aggtcatggg tcgtcactgc 

151261 ggttggctgg ccttgatgtc tgctatcagt accggtgccg actggctgtt cgtgcccgag 



151321 atgccgccca aggacggatg ggaggatgac atgtgcgcta tcattaccaa ggtgggttga 

151381 tcggaacttg gtggagagaa ctcagaggca tcactaactc cccgcagaac agaaaggagc 

151441 gtggaaagcg taggacgatc gtcatcgtgg ccgagggtgc ccaggatcgc catctcaaca 

151501 agatctcgag ttcgaagatc aaggatattt tgacggagcg gttgaacctg gatacccgtg 

151561 tgactgtgtt gggtcacact cagagaggtg gagccgcctg tgcgtacgac cgctggctgt 

151621 ccacactgca gggtgtcgag gctgtccgcg cggtgctgga catgaagccc gaagccccgt 

151681 ccccggtcat caccatccgt gagaacaaga tcttgcgcat gccgttgatg gacgccgtgc 

151741 agcacaccaa gactgtcacc aagcacattc agaacaagga gttcgccgaa gccatggccc 

151801 tccgcgactc ggaattcaaa gagtaccact tttcctacat caacacttcc acgcccgacc 

151861 acccgaagct gctcctccca gagaacaagg tttgtcgcca cagtagctgc ttcggtgctg 

151921 agctaacaaa aggcagagaa tgcgcatcgg tattattcac gttggcgccc ccgctggtgg 

151981 tatgaaccag gctacccgcg cggccgttgc ctactgcctg actcgtggcc acacccccct 

152041 ggccattcac aacggtttcc ccggtctgtg ccggcactat gatgacaccc cgatctgctc 

152101 tgtgcgcgag gtggcatggc aggaatcgga cgcctgggtc aacgagggtg gttcggatat 

152161 cggtaccaac cgtggtctgc ccggcgatga cctcgcgacc acggcgaaga gcttcaagaa 

152221 gttcggattc gatgcgttgt tcgtcgtggg tggatttgag gcgttcaccg ccgtcagcca 

152281 gcttcgccag gcgcgcgaga agtaccccga attcaagatt cccatgaccg tgctgccggc 

152341 gaccatttcc aacaacgtgc cgggcacaga atactctctg ggtagcgaca cctgccttaa 

152401 caccttgatc gacttctgcg acgccatccg ccagtcggcc tcgtcctctc gtcgccgtgt 

152461 gttcgtcatc gagacgcagg gtggcaagtc gggttacatc gccacgacgg ctggtctgtc 

152521 ggtgggcgcg gtagccgtgt acattcccga ggagggcatc gacattaaga tgctggcccg 

152581 cgacattgac ttcctgcgtg acaactttgc gcgcgacaag ggagcgaacc gcgccggtaa 

152641 gatcatcctg cgtaacgagt gcgcgtccag cacgtacacg acacaggtgg tggccgacat 

152701 gatcaaggag gaagccaagg gacgtttcga gagtcgtgcg gcggtgccgg gacacttcca 

152761 gcagggtggc aagccgtcgc cgatggaccg tatccgggcg ttgcggatgg ccaccaagtg 

152821 tatgctgcac ctggagagct atgcgggcaa gtcggcggat gagattgcgg ccgatgagct 

152881 gtctgcgtcg gtcattggta tcaagggctc gcaggtgttg ttctcgccga tgggtggaga 

152941 gaccggcctg gaggcgaccg agacggactg ggcgcgccgt cgacccaaga cggagttctg 

153001 gctggagctg caggacacgg tgaacattct gtcgggacgg gcgagcgtga acaacgcgac 

153061 gtggagttgc tatgagaatg cttaagcaaa aagcagcgat aaacgcctag actagagtca 

153121 ggctatgatg atgaaaggtc atgagcaata ccgaattcaa ctacagtaca gagtcacatt 

153181 caaacgatgg gtgccagtaa aaagtaatct aaagctgttt atttaaatac atacccccta 

153241 gccttccaaa taaaataatc cctccccttc gccaaccaca ccatgaaact aaacaacacc 

153301 aacccaaacg cctgtcccgt tcccccagaa cttcccaccg gcaccaccat catcacctcc 

153361 gtcccttcat catacgtctt cctcccacca ctgcaccccc tccaccaccc tatccccccc 

153421 ttcccagcac tattatcaat atcattcacc aaattcccca ctaccacccc cgccatctca 

153481 tccgccttga acgacaactt ctgcttacaa tccgtaacat cccccgccgc atacacccca 

153541 tccaccccca ccactttcaa ctccgaatcc accctcaccc agcccttctc atccaacacc 

153601 tccccgggga tgtaagaact attaggcagg actccgatcg tagggatata aatatccgta 

153661 tccatcgtct ccccattctc gaacttaagc tcatacccac ctccaccttc cttagaagcc 

153721 gagacagaaa taaccttggt attggtcaga actcgcacgc ctttactcgt gagaagagac 

153781 gtcgccgtcc cgctcgcgga ttctttcaag acggggagaa tgcgcggcgt tgaggaaatc 

153841 aaagtgatat ccttcttctc tttggaagaa gcagaagagg tgaggtcggc aagttcaccg 

153901 gcgagttcta ccccgatggg gccggcgcca atgatggtga tctttttgga ggtggagatg 

153961 gtgtgttgcg cggattggat tgtctctgtt attgtggatg tggaggtcgg ggagagtttg 

154021 aaggggtaga tcacttcttt cctatctgca tctttgtttt cggtttcggt ttcaccgggg 

154081 aatagtggtg atgaggatga tgcggagggg gtgcttcctg atgcgatgac gaggtagtcg 

154141 tatttgattt ctctcagggc cttgttggtg ttgttggtgg tggattcctc gacctggagg 

154201 agtttgttct cgaagtcgag agacgtagct cttcctagga tgaattcgaa tacctcgggg 

154261 gagtattttg tgaacccgtc ggcgatcggg atgaggtact gtgattcgga gaaggcagtt 

154321 ggcttggcga ggatgcgcgg ggcggcgatg gcccagtaga agttgggggt ggggttgatg 

154381 aggaggattt ttattttgat gcctttggat ttgttggtgg tgaagtggga gaggagggag 

154441 tgggcggtgg tgagaccggc gaaggaggcg ccgattatgg ggattgttat tgttttgtcg 

154501 gtggtcatgg tgggtagatg gtgagtggtt gtttgttatg gaagggatgg aagttaagta 

154561 ggattaaagt agataaagta ggtgtagatt aattagacta gtgctatcgc tagtggattc 

154621 ttgacagtct tatactttgc tcagtgagtt tgtcagacat tcccgctgct tgataagcgc 

154681 agtgccgatc tgcactacta tcgatgtgct gcagacggag ttaactattt tcttactttc 

154741 acttgaatgg atccttcatc acaagtaatt agttacttaa taacctggtt atatcccctg 

154801 cagctcgtcc agtagctgca ctgtctcggc tggacttgca atccacgctg tcgtcgatgc 

154861 cagtcgcgcc tatcattatt agcatctcgc catatttgtc cttttagagt attagcaaac 

154921 cttgaaatca ttcgctcccg ccgacaaaaa ctgatcgccg acatgcagcg tccgcgccgg 

154981 ttcaatccct ccaaagtagc gctgacacgc gcgcacaccc cacgacttgt cgccgatgtc 



155041 cacgaagaca tcattgccac ctgattagcc atcagcatga ttcacagtgg taccaaaatg 

155101 agaggaggaa aaggaatacc gttaaaggcg caaaacggca atctggctcc cacctccgac 

155161 cgctcgaccg tattctgcac caccaacacc gtctcctcca gctgctcgcg actaatcctc 

155221 cctctctccg tgggatacac tccgaccgcg cgatccttgc gcagcaccgc gaccggcaga 

155281 ttcagattac tcgcacacgc gcgcaacgat gactcggcga tatccaacag ccgcgtgata 

155341 tccccctcct gccacgtctt catctcctcc aataaccact cctcacgagg gacatagctc 

155401 aacctcgacg gcgaggcatg atcataccgg aagaggaaat tactctctcc gcccatgacg 

155461 accagcccgg cgcgctgagc atccgtcatc tctgcggact cgtacatcgc atcgagcagt 

155521 cccttgagtc gctcgtagta cttcgcggcg tcggtgtagc ccgctgccgt cacgatgccg 

155581 actttccgtc cctgcaggag gaggcgcatg atgcgtagaa tcacggggtt gtcgtccgtg 

155641 agacatgcgc cgtcgtcgta cagagtgacg tctccatcga aggtgacgag gtcgacgccg 

155701 gtggtgcggg ccagaccgat gagttgggcc gagttgagaa tgagccggat gtcgttgaaa 

155761 gaaggggcca cgaagcgtcg tcggctgatg aagcgttggg cgtcttgccg gcggaaggcg 

155821 tcgacgaggt ataagggggt gaagaaggtg cctacggtgg ggacaaggag tttgagtttg 

155881 gatttaccgg gggcgtcgtc ttgttggtgg gttactgtgt ttgtcagttc atgtgtgtct 

155941 aattataaag aaagaatggg agaggccgag ggagagctac tgtggtcgcg gatgagcatc 

156001 tccacgtcgc ggaagatctc cgcatagcgc tggtgcgtgt cggccgctac ggcgatcagg 

156061 ttctcactgt gttcctggta gacggcggtg ggctgcgagt gcagcacaaa ggggacggcc 

156121 aggaggccct acattgttta gtatgtagta tatagtgctg gttggtgagg tgccatactt 

156181 taatccattc aatctgggaa cttgttagct aagtcaacag ctactgaggg aatagggaat 

156241 taccaattgg tctcgacggt gagactgcag gagttagcat ggactggcag cagagatgac 

156301 tgcctacctt gagagcatct gaaggattag ataaatgact caattggatt gaccaggcaa 

156361 ttcgacttac attcaacgcg gtagcgggta gtcatcgtga ggggaagcta aagctggaag 

156421 ctgtcctttg ggatcggcgg tgagggagtc aatggcacaa caacgaacga gggaggatcg 

156481 acaaggtatt tataaagaca atcaattaat gagtgagaac gaaacagatt gaagaataaa 

156541 agaaaacagc tatgcaagga gagagcgaga aaggagaagc acagtcgaga tagggatgac 

156601 acagtgcttg tcatctgacc gggccagcgg ggctgccagc caaccacatg cagcatcgga 

156661 gctctccgac acgccagttg gcgggctgag atgcacattg ggatgcacag gctgtcagct 

156 721 attgatctat cacagggaat ccatttggtc aatctgtatc tttattgcca ttggattcaa 

156781 taaaacacca ggaaggagat agctaagtaa catagtcaag caacagttac acccgctcgc 

156841 gatgctccgc ccctccaaat ccatcctcca cctgctgttg tttatccgcc accacctggt 

156901 cctgctggtc ggcatcacgg ttgccatgcc gtccgatttg ataccagggc aatgcaaaca 

156961 accgatccat actctctaga ctccgccccg ctgtctcagg cacactgaac cacacccaca 

157021 gcccacccag caccgtcacg gcagagaaga accagaacgt gcccttgggg gtgatgccac 

157081 cctccgcagt acccagcaac atattgggca ccgctcggga attaccatac tgattgacaa 

157141 agtgcagcgt catagcagca gaggtagcca gtgcgcggat gcgtagcggg aacagttctg 

157201 cggtcagcag atactgcatg gagttccagc ccaatgccca gcccacaccg gagatataga 

157261 tcatagcaat ggctccgcgg ctactaccct tctttgactc cggcaggaca aagtcatcga 

157321 caatgcccat ctccggcacg gcagtgagga agctagccac gtagatcatc gagatggcct 

157381 gcagcgtgat tcctaggagt agggcacgtt tgcgcccaat cacatccacc aggaacagag 

157441 cgcaaatcac agctgcgacc agcttgatga taccgaacac tgctgtcaca agtagggact 

157501 cgttggaccc cgtgatgccc aggagactga acagatccgg cgcgtacagc gtgatggatc 

157561 ccgctccaga ccattgggac agtagctgcg ccatggcggc taggtacagc cggtacaggt 

157621 tgctcgggac catgatggcc tccttgacaa tgcccagcca gccagctccc atggtagcct 

157681 ccatctccgc ctgatgggca ttctggatac ccgtgaactc ctcgacaacg tattcatgct 

157741 ctggaggaag attgcggagg cgcgcaaggt tcttcagtgc ttcttcgggc ttgcctttct 

157801 ttacaaggaa gcgaggagac tcgtattgga ggaaggagag gaggaagatg aggccggcaa 

157861 acatgatgtg gagactagtt gggacttcct aggtagggtc agtacattgc catggagaag 

157921 gtgaatgaat gccttaccca tcgcttgtga gtgtggtcgc ccatatggat ctggcagccg 

157981 tagttggtga agtaggcgag cacaatgccc aggtaaacga agccggtgaa gacacaagtg 

158 0 41 cagagcccgc ggacggatgc tggggcctgt ttttggttaa tcaagcccga cagggaattg 

158101 aacttgtcac tcacgatctc agccaggtag actggtccca caactacagt ctggccgacg 

158161 cccagtccgg ccacgaagcg accagcatag acagccccca ggctaccgtt ggcaccgacg 

158221 cccatgaaaa tggcaattcc caggatccaa agaacgcaga gttggcgagt cgcccacagc 

158281 cgtccaatac ggtcgcacac aaggaatgca ctataacagt gtcagcagac agacttcacg 

158341 gttcattttg gtgaatggcg tacaagagtg cccctcccac ggaaccaatt tgcaccatag 

158401 cggacacatt cgccttgatg ttggtctgtt caacctcact gtacgagtcc agatgaatag 

158461 tgcgctgaaa atccttggag ttaaaggcac ccgagatcaa accctcgtca acaccacggg 

158521 cagcgcccat gaggccaaag gagagcacgg caaaccatag cctccagttg aagacctcct 

158581 tggggtcagc gagattgcgc aagcgaaaaa tgttaactgg tttcttggaa cctgccatga 

158641 ttgcacacgg tcaatcccag aaaagttgga cgagccacta caggttttct ttttaaacag 

158701 ctggcatctg caagggagga ggggcattta tacccagtca cccccatcag agctacgtcg 



158761 gttaccgaca gccagtcgga acgtcaccta ttcatggtga ggtaggacaa actagtccgg 

158821 gggagaattc actacgaaaa atgtgatcat gcggggatcg gggagagagc agccaatgga 

158881 tcgcgcaggg aacgatatcc accagcaaga catcccacgt tggaatccga cttgtcgcga 

1589 41 cgagcttatc accgcttgac atgctggaca gcttggcgag tagcaaagcg acaaaattcc 

159001 gaagatgcgc tgggtttgtt gtgggacgga gcgtcttggc tgcgggggac aatccgggga 

159061 tcgagcctct tcgtggcatc ccagacccgg tcctgaaagt ccaggtggga cagaccgaca 

159121 atctggggac gaggctcgga caacttccgc ttaggactaa cggaagcagg ccatcgagat 

159181 gcgagtgaga tgcttcgggg taggcttggc aggcgggaga ccggagacgg tctcagcact 

159241 ggcaggcacg ggatatggct tccaggagag gtcgagatgg gagccacgcg gcgtgagatg 

159301 ttggggtcat gcgggggtca gcccaagggg ggggtgtccc tcgcgcaggg tctaaataac 

159361 actaagaggg cactaagaaa gcaaacatgg tggttggctt gtcctgctgg tcggtcaaat 

159421 ccgacgagtg ggggatacat gacgtttgct gggcacgatc gatcagctgt ctctcaatcc 

159481 ggggtgcagc aacaaatgcc tcggtcgcca gcagttttgg gttaaagatc gacgggttcc 

159541 agagctgtta tttagtaatg atagcaagac aatctctcta ttccactggg accggattgc 

159601 gccctaaccc aaccctaccc cacccctcgg gtgagatcct tcggatcagc accattatcc 

159661 ccgtgcacat tctaccatcg taaaaattac ttatatgggg gtttgaagaa agagtccatg 

159721 ataattattc agatagtgca acgttgcaga catatttatt tgatctgagt ttgagattgc 

159781 ggcatcgaga ccagaacgga gtctggctga ctgcaggtgg cgaggtgtta tcatggctac 

159841 atcatccatc ttgcttgctt gatccaggaa ccaaaccagt taccgacgca ggggttcacc 

159901 cctataaaat tcatcccagt aggccatatc actgtaaaac ccatcagtag gaagtaggaa 

159961 atagaacagg tagaatgtcc tgcctcatcc aagattccgt cttcggccaa tttatccgcc 

160021 tcgtttcgcg cggccggtac ttccggttcg aggagcaaaa agacccctcg ctatggcagc 

160081 agtacatcaa cctcgacaaa tcgggccgca tggcctacac tggtcgtccc gatgccgatg 

160141 aggtaccgga gcctgagcgc ttcagcgcag ccgacaccga gtccatgacg agagtgccct 

160201 cgacggcgtg gtcacataat ggccgccgct atcaaggaat caccggagcc gccattgacc 

160261 cggaaaaggg acgcgacgtg accatcatcg actggtggga tgatcatgat ccggaggtat 

160321 gctatttact ttactattct cttctggaca gaagctaact catcagaacc ctcaacactg 

160381 gccgctatgg aagaaagttg tggtgacctt cgagatctgc ctgctcactt tcagcgtgta 

160441 catcggtagc gccatctaca gtgctggtat cgagtcagtg atgagcgaat ttcagatcag 

160501 ccaagtcgca gctaccctag gactgaccct gttcgtggct ggctacggac tgggcccgct 

160561 tctctggtcg cccatgagcg aggtgcccca gatcggtcgc aactccgtct acatcgcgac 

16 0621 actaattgtt tttgttgcct tgcaagtccc agctgcattg gccgggaacc tcggcactct 

16 06 81 actagccttt cgcttcctga ccggcttctt tggctcgcct gctctcgcaa cggtaccttt 

160741 acccttcccc tcacttaccc ttgtccacta acaacatcta gggcggcgcc agcctcgccg 

160801 acatgttcgc tcctcgcaaa cgcgcctacg gcatcggcat ctggggcatc agcgccatct 

160861 gcggccccgt gctcggtccc ctagtcggcg gatttgccgc ccaagccaaa ggctggcgct 

160921 ggacgatctg ggaactcatg tggctatcag gattcacact agttgtactt ataatcttcc 

160981 tcccagagac ctcctccagt aacattctgt accgacgcgc ccgccgtctc cgcaaactga 

161041 ccaaccggtc caacttgcgc agcgagcccg aactcgccag cgaacacctc acagcgcgag 

161101 aactcgccat gatgacgctc gtccggccct tcacgctcaa cttcaccgag cccatggtct 

161161 tcttactgaa cctatacatc gccctgattt acggtctact ctacgtatgg ttcgaatcct 

161221 tcaccatcgt attctcaggt atctacggct tcaacctcgg ccaacagggt ctcgcctaca 

161281 tcggcatcct aaccggtgcc ctgatcacca ttcctccata ttactggtgg atgcacaaat 

161341 acctagaacc gaaattcgac cccgagacag gaaatgttcc cccagaggcg cgtctccccc 

161401 ccgctatcgt gggaggcttc ttcatcccta tctgtctgtt ttggtttggg tggagtgccc 

161461 gtccctctat tcattggatc atgccgattg tcgggagtgg attcttctcc gtcggtgcgt 

161521 tcttgctctt caaccccgtg ctgaattatc tctccgatgc gtacccggag tatgcggcta 

161581 gtgttctcgc tgggaatgat ttgtttcgga gtgcctttgg tgcagggttt ccgctttttg 

161641 cgacggcgat gtataagaat ctgggggtgg attgggcgag ctcgacgttg gcgtttttgg 

161701 ggtgtgcgtt tattcccatt ccggttgtgt tgatgaaggt gagttttctt tgggcccctt 

161761 ggtagaaata aatggctaat tgttgtggtg tgtagtatgg ggggacgttg aggaagaagt 

161821 atagtcggca tgcgaggaaa gatatttagg ttggttaatt tctttatttc cggggtagag 

161881 ggatagataa atttccttgg tgggcttaca tggcactgct agaatacagt gggtatcata 

161941 tctgtattat tattttgtat catatttcac tatacctgca catctctgcc atggattttt 

162001 cctcgaggca tatgatgcgc tgggatcctt taagtgtatt gggaacctag atatcacgag 

162061 accgaatttg gataagaaca gtctttcttc tgaaccctaa cagtattagc tttgagcgtc 

162121 gagtgccaaa ttcggttatc tgctgctgtt tgcactggcg tagtgtgata tgttgagcga 

162181 tatgctttct tcagacggcg agatatacgt ttcttgtaga gctcagtcgg gtttgatagc 

162241 aaccaagccc gtcatcttca ggaacatttc acctaataca gtcagtacca accataatga 

162301 acaaggtcca gaaagactac ctccatggta ttagcacaag acaatcgtat gatcagctca 

162361 tccaattcct tgtagctaca ccagagaaat taatgtggtg aagtgatact gacagtatct 

162421 ttcggcggag gggattttac gcgacaatca tctagcaaaa tatagggatt cgtggcaact 



162481 
162541 
162601 
162661 
162721 
162781 
162841 
162901 
162961 
163021 
163081 
163141 
163201 
163261 
163321 
163381 
163441 
163501 
163561 
163621 



gcctagggtt 
gtcaacttca 
cctggcgaac 
tttccggata 
caaggaagac 
cagtctcgct 
atggcagcgt 
cgaacaccag 
acactacgat 
gattgacgac 
tcgcgcagga 
gtgcctggag 
ccgatccgga 
gtttggcatc 
cccaagagtc 
aattatactg 
gacgaaagct 
tcgtcatagg 
caatcaatgt 
aggtgctaaa 



gagcagtaag 
aatgtaaccg 
caactgagtc 
aaaagcagca 
tcgactctcg 
tccctgcaat 
atcatttctc 
gatagcccct 
atagacctca 
gtaagccaca 
cagtcgatcg 
cgtgtcaaaa 
cctagtcgcc 
accactaagt 
aggggtgaga 
aaaccagatc 
gcgcacgatg 
ggaatcgtcg 
agcattggta 
tgagtaatcg 



gtggcccttt 
aagggagaaa 
tatatgctaa 
gccactcaag 
tcagtgtaca 
gactccatac 
caagcgggca 
aaagtccaga 
ctgactttcg 
tctcggaagg 
gaaaggatct 
ccagcatacc 
tgacccgaga 
atgaaagatt 
ttcaatcctg 
gcccgcgtct 
ggctccagag 
agcagctcca 
taaaccaccg 
tctggcatat 



aaagggccgg 
gtatattaat 
tatggtccaa 
gacgtatgga 
accttaccat 
cctcactttg 
cacgagcgcg 
caagaacggg 
ccctacccgt 
tttttgacat 
cactgcaccc 
agcctttatc 
aaatggcggt 
gcggggcttt 
cagactgatg 
tatcactgta 
aacactctgt 
ggttggcccg 
catcataagt 
accaggccga 



ggtgacagtt 
ctcacaataa 
tattgggata 
atcggtatcc 
ctcatgaccg 
atcccgatac 
gcgagtcttc 
caaggccaga 
cgtcatggca 
cgccgcagcc 
agtaatattt 
cagcgggtcg 
taggaaggga 
gtgcacacca 
gacattatta 
gttgttctgg 
agcttcccac 
aggggccatg 
ggagactggg 
gagaccgctt 



catatagata 
gtggggttga 
gttgagaatt 
gtacaccgac 
cttgcttgca 
aagaacagca 
caaagcgtgc 
taagcccagt 
gccgatgcgg 
acgttgtcca 
gccacggcca 
aaatatattt 
tagatccact 
agatcggaac 
gtggaagtcc 
tagactgtag 
tgtgtgccat 
tattgaattg 
gtcattgaaa 
ttcggtagtg 
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LOCUS (LOC) : 
GenBank ACC . NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 
SOURCE : 
ORGANISM (ORGN) : 



REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



AM270325 GenBank (R) 

AM270325 

AM270325.1 GI:134081809 

928583-76-0 

226503 

DNA; linear 

Plants, fungi, algae 

24 Mar 2007 

Aspergillus niger contig Anl4c0180, complete genome. 
Aspergillus niger 
Aspergillus niger 

Eukaryota; Fungi; Ascomycota; Pezizomycotina; 
Eurotiomycetes ; Eurotiales; Trichocomaceae; mitosporic 
Trichocomaceae ; Aspergillus 

1 (bases 23835 to 25756) 
Jones, S.W.; Luk,K.C. 

Isolation of a chicken thioredoxin cDNA clone. 
Thioredoxin mRNA is differentially expressed in normal 
and Rous sarcoma virus-transformed chicken embryo 

fibroblasts 

J. Biol. Chem., 263 (20), 9607-9611 (1988) 

CA 109:105752 

2 (bases 16557 to 19301) 

Robinson, J. S . ; Klionsky, D . J . ; Banta,L.M.; Emr,S.D. 

Protein sorting in Saccharomyces cerevisiae: isolation 

of mutants defective in the delivery and processing of 

multiple vacuolar hydrolases 

Mol. Cell. Biol., 8 (11), 4936-4948 (1988) 

CA 110:20954 

3 (bases 181479 to 182990) 

Jackson, B.J. ; Warren, C . D . ; Bugge,B.; Robbins,P.W. 

Synthesis of lipid-linked oligosaccharides in 
Saccharomyces cerevisiae: Man2GlcNAc2 and ManlGlcNAc2 
are transferred from dolichol to protein in vivo 
Arch. Biochem. Biophys., 272 (1), 203-209 (1989) 
CA 111:74550 

4 (bases 197258 to 198879) 
Szumanski,M.B. ; Boyle, S.M. 

Analysis and sequence of the speB gene encoding 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 
JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 



agmatine ureohydrolase, a putrescine biosynthetic 

enzyme in Escherichia coli 

J. Bacterid., 172 (2), 538-547 (1990) 

CA 113:18542 

5 (bases 8900 to 10226) 

Lambert, S . ; Yu,H.; Prchal , J . T . ; Lawler,J.; Ruff, P.; 

Speicher, D. ; Cheung, M. C . ; Kan,Y.W.; Palek,J. 

cDNA sequence for human erythrocyte ankyrin 

Proc. Natl. Acad. Sci. U.S.A., 87 (5), 1730-1734 (1990) 

CA 113:146283 

6 (bases 149275 to 150765) 

Chen,W.N.; Balzi,E.; Capieaux, E . ; Choder,M.; Goffeau,A. 
The DNA sequencing of the 17 kb Hindi I I fragment 
spanning the LEUl and ATEl loci on chromosome VII from 
Saccharomyces cerevisiae reveals the PDR6 gene, a new 
member of the genetic network controlling pleiotropic 
drug resistance 
Yeast, 7 (3), 287-299 (1991) 
CA 116 : 16482 

7 (bases 20272 to 22263) 
Tobias, J. W.; Varshavsky, A. 

Cloning and functional analysis of the 
ubiquitin-specif ic protease gene UBPl of Saccharomyces 
cerevisiae 

J. Biol. Chem., 266 (18), 12021-12028 (1991) 
CA 116:122304 

8 (bases 92137 to 93393) 

Gosalbes,M. J. ; Perez-Gonzalez, J. A. ; Gonzalez , R . ; 
Navarro, A. 

Two beta-glycanase genes are clustered in Bacillus 

polymyxa: molecular cloning, expression, and sequence 

analysis of genes encoding a xylanase and an 

endo-beta- ( 1 , 3 ) - ( 1 , 4 ) -glucanase 

J. Bacterid., 173 (23), 7705-7710 (1991) 

CA 118:33731 

9 (bases 159715 to 163074) 

Kumar, v.; Ramakrishnan, S . ; Teeri,T.T.; Knowles, J.K. ; 
Hartley,B.S. 

Saccharomyces cerevisiae cells secreting an Aspergillus 
niger beta-galactosidase grow on whey permeate 
Biotechnology (N.Y.), 10 (1), 82-85 (1992) 

10 (bases 197258 to 198879) 
Szumanski,M.B. ; Boyle, S.M. 

Influence of cyclic AMP, agmatine, and a novel protein 

encoded by a flanking gene on speB (agmatine 

ureohydrolase) in Escherichia coli 

J. Bacteriol., 174 (3), 758-764 (1992) 

CA 119:21634 

11 (bases 16557 to 19301) 
Wada,Y.; Ohsumi,Y.; Anraku,Y. 

Genes for directing vacuolar morphogenesis in 
Saccharomyces cerevisiae. I. Isolation and 
characterization of two classes of vam mutants 
J. Biol. Chem., 267 (26), 18665-18670 (1992) 
CA 117:105596 

12 (bases 20272 to 22263) 

Baker, R.T.; Tobias, J. W. ; Varshavsky, A. 

Ubiquitin-specif ic proteases of Saccharomyces 

cerevisiae. Cloning of UBP2 and UBP3, and functional 

analysis of the UBP gene family 

J. Biol. Chem., 267 (32), 23364-23375 (1992) 

CA 119:23561 



REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



13 (bases 16557 to 19301) 
Horazdovsky,B.F. ; Emr,S.D. 

The VPS16 gene product associates with a 

sedimentable protein complex and is essential for 

vacuolar protein sorting in yeast 

J. Biol. Chem., 268 (7), 4953-4962 (1993) 

CA 118:229898 

14 (bases 32776 to 34762) 

Roebroek,A. J. ; Creemer s , J . W . ; Pauli,I.G.; Bogaert,T.; 
Van de Ven,W.J. 

Generation of structural and functional diversity in 
furin-like proteins in Drosophila melanogaster by 
alternative splicing of the Dfurl gene 
EMBO J., 12 (5), 1853-1870 (1993) 
CA 120:70454 

15 (bases 181479 to 182990) 

Jackson, B.J . ; Kukuruzinska, M. A. ; Robbins,P. 
Biosynthesis of asparagine-linked oligosaccharides in 
Saccharomyces cerevisiae: the alg2 mutation 
Glycobiology, 3 (4), 357-364 (1993) 
CA 120:126513 

16 (bases 22652 to 23365) 

Errabolu,R.; Sanders, M. A. ; Salisbury, J . L . 
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Nagarajan, V. 

Genetic analysis of a gene cluster for cyclohexanol 
oxidation in Acinetobacter sp. Strain SE19 by in vitro 

transposition 

J. Bacteriol., 182 (17), 4744-4751 (2000) 

CA 134:37832 

73 (bases 163244 to 165037) 
Saupe, S.J. 

Molecular genetics of heterokaryon incompatibility in 
filamentous ascomycetes 

Microbiol. Mol. Biol. Rev., 64 (3), 489-502 (2000) 
CA 134:27321 

74 (bases 128987 to 130729) 
Tanaka,A.; Tsuge,T. 
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JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 
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JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 



Structural and functional complexity of the genomic 
region controlling AK-toxin biosynthesis and 
pathogenicity in the Japanese pear pathotype of 
Alternaria alternata 

Mol. Plant Microbe Interact., 13 (9), 975-986 (2000) 

CA 134:96037 

75 (bases 143978 to 145355) 
Stintzi,A.; Browse, J. 

The Arabidopsis male-sterile mutant, opr3, lacks the 
12-oxophytodienoic acid reductase required for 
jasmonate synthesis 

Proc. Natl. Acad. Sci. U.S.A., 97 (19), 10625-10630 
(2000) 

CA 133:293530 

76 (bases 54264 to 55012) 

Amerik, A. Y . ; Nowak,J.; Swaminathan , S . ; Hochstrasser , M. 
The Doa4 deubiquitinating enzyme is functionally linked 
to the vacuolar protein-sorting and endocytic pathways 
Mol. Biol. Cell, 11 (10), 3365-3380 (2000) 
CA 134:82296 

77 (bases 208451 to 210658) 
Tieu,Q.; Nunnari,J. 

Mdvlp is a WD repeat protein that interacts with the 
dynamin-related GTPase, Dnmlp, to trigger mitochondrial 
division 

J. Cell Biol., 151 (2), 353-366 (2000) 

CA 134:52741 

78 (bases 208451 to 210658) 
Mozdy,A.D.; McCaf f ery, J .M. ; Shaw, J. M. 

Dnmlp GTPase-mediated mitochondrial fission is a 

multi-step process requiring the novel integral 

membrane component Fislp 

J. Cell Biol., Ibl (2), 367-380 (2000) 

CA 134:52934 

79 (bases 94126 to 9/982) 

Kadoya,T.; Kishida,S.; Fukui,A.; Hinoi,T.; Michiue,T.; 
Kikuchi, A. 

Inhibition of Wnt signaling pathway by a novel 
axin-binding protein 

J. Biol. Chem., 275 (47), 37030-37037 (2000) 
CA 134:233219 

80 (bases 80610 to 81592) 

Ui,S.; Takusagawa, Y . ; Ohtsuki,T.; Mimura,A.; Ohkuma,M.; 
Kudo, T. 

Stereochemical applications of the expression of the 
L-2 , 3-butanediol dehydrogenase gene in Escherichia coli 
Lett. Appl. Microbiol., 32 (2), 93-98 (2001) 
CA 135:353434 

81 (bases 80610 to 81592) 

Otagiri,M.; Kurisu,G.; Ui,S.; Takusagawa, Y . ; Ohkuma,M.; 
Kudo,T.; Kusunoki,M. 

Crystal structure of meso-2 , 3-butanediol dehydrogenase 
in a complex with NAD+ and inhibitor mercaptoethanol at 
1.7 A resolution for understanding of chiral substrate 

recognition mechanisms 

J. Biochem., 129 (2), 205-208 (2001) 

CA 135:2180 

82 (bases 41237 to 43709) 

Davidson, E. ; Caf f arella, J . ; Vitseva,0.; Hou,Y.M.; 
King, M. P. 

Isolation of two cDNAs encoding functional human 
cytoplasmic cysteinyl-tRNA synthetase 
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AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 

REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



Biol. Chem., 382 (3), 399-406 (2001) 
CA 135:118589 

83 (bases 119272 to 121189) 

Cukovic,D.; Ehlting,J.; VanZif f le, J . A. ; Douglas, C.J. 
Structure and evolution of 4-coumarate : coenzyme A 
ligase (4CL) gene families 
Biol. Chem., 382 (4), 645-654 (2001) 

CA 136:145906 

84 (bases 87776 to 89339) 

Gassama-Diagne, A. ; Hullin-Matsuda, F . ; Li,R.Y.; 
Nauze,M.; Ragab,A.; Pons,V.; Delagebeaudeuf , C . ; 
Simon, M.F.; Fauvel,J.; Chap,H. 

Enterophilins, a new family of leucine zipper proteins 
bearing a b30.2 domain and associated with enterocyte 

differentiation 

J. Biol. Chem., 276 (21), 18352-18360 (2001) 
CA 136:114413 

85 (bases 80610 to 81592) 

Otagiri,M.; Kurisu,G.; Swaminathan , S . ; Ui,S.; 
Yoneda,S.; Ohkuma,M.; Kudo,T.; Kusunoki,M. 
Crystallization and preliminary X-ray studies of 
meso-2, 3-butanediol dehydrogenase from Klebsiella 
pneumoniae IAM1063 

Acta Crystallogr. D Biol. Crystallogr . , 57 (PT 6), 
857-859 (2001) 

86 (bases 51277 to 53277) 

Zenklusen, D. ; Vinciguerra, P . ; Strahm,Y.; Stutz,F. 
The yeast hnRNP-Like proteins Yralp and Yra2p 
participate in mRNA export through interaction with 
Mex67p 

Mol. Cell. Biol., 21 (13), 4219-4232 (2001) 
CA 135:177841 

87 (bases 1 to 226503) 
Pel,H.J.; de Winde,J.H. 
Hofmann,G.; Schaap,P.J. 



Archer, D.B. ; Dyer, P. S.; 
Turner, G.; de Vries,R.P. 



Albang,R.; Albermann, K . ; Andersen, M. R. ; Bendtsen, J . D . ; 
Benen,J.A.; van den Berg,M.; Breestraat , S . ; 
Caddick,M.X. ; Contreras, R. ; Cornell, M.; Coutinho, P . M. ; 
Danchin, E . G . ; Debets, A. J. ; Dekker,P.; van Dijck,P.W.; 
van Dijk,A.; Di jkhuizen, L . ; Driessen, A. J . ; d'Enfert,C. 
Geysens,S.; Goosen,C.; Groot,G.S.; de Groot,P.W.; 
Guillemette, T . ; Henrissat , B . ; Herwei jer, M. ; van den 
Hombergh, J.P. ; van den Hondel, C . A. ; van der 
Heijden,R.T. ; van der Kaaij,R.M.; Klis,F.M.; 
Kools,H.J.; Kubicek, CP . ; van Kuyk,P.A.; Lauber,J.; 
Lu,X.; van der Maarel, M. J . ; Meulenberg, R . ; Menke,H.; 
Mortimer, M. A. ; Nielsen, J.; Oliver , S . G . ; Olsthoorn, M. ; 
Pal,K.; van Peij,N.N.; Ram,A.F.; Rinas,U.; Roubos,J.A. 
Sagt,C.M.; Schmoll,M.; Sun, J.; Ussery,D.; Varga,J.; 
Vervecken, W. ; van de Vondervoort , P . J . ; Wedler,H.; 
Wosten,H.A.; Zeng,A.P.; van Ooyen,A.J.; Visser,J.; 
Stam, H. 

Genome sequencing and analysis of the versatile cell 

factory Aspergillus niger CBS 513.88 

Nat. Biotechnol., 25 (2), 221-231 (2007) 

CA 146:310276 

88 (bases 1 to 226503) 

Pel, H.J. 

Direct Submission 

Submitted ( Ol-MAY-2006 ) Pel H.J., DSM, 624-0295, P.O. 
Box 1, 2600 MA Delft, THE NETHERLANDS 



FEATURES (FEAT) : 
Feature Key 



exon 

intron 

exon 

gene 
mRNA 

CDS 



<1150 . .>1491 
join(<1150. .1323, 
1420. .>1491) 
join (1150 . . 1323, 
1420. .1491) 



1150. .1323 

1324. .1419 

1420. .1491 

<2852 . .>4742 
join(<2852. .3927, 
3989..4085,4149..>4742) 
join(2852. .3927, 
3989. .4085, 4149. .4742) 



Qualifier 

/organism="Aspergillus niger" 
/mol-type="genomic DNA" 
/db-xref ="taxon : 5061 " 
/clone="Anl4c0180" 
/locus-tag="Anl4g050 40" 
/locus-tag="Anl4g050 40" 

/locus-tag="Anl4g05040" 

/codon-start=l 

/product=" hypothetical protein" 
/protein-id="CAK42065 . 1" 
/db-xref="GI : 134081810" 
/translation="MSLSFSRC P AKLNVNGQRKS 
AEEPTRSPSIARLGIDSSFLLNGP 
MWIAQDRSAPFRASAMECWGKGEAGAWVATVRQR 
LLE" 

/locus-tag="Anl4g05040" 
/number=l 

/locus-tag="Anl4g05040" 
/number=l 

/locus-tag="Anl4g050 40" 
/number=2 

/locus-tag="Anl4g05050" 
/locus-tag="Anl4g05050" 

/locus-tag="Anl4g05050" 

/inference="profile: COGS: COGl 716" 
/inference="prof ile:PFAM:PF00498" 
/inf erence="similar to AA 
sequence:PIR:T37862" 
/note="unnamed protein product; 
Function: dmal of S. pombe is a 
component of the spindle assembly 
checkpoint, required to prevent 
septum formation and premature 
exit from mitosis if spindle 
function is impaired. Similarity: 
the predicted ORE contains an FHA 
(Forkhead-associated) domain, 
which is a phosphopeptide binding 
motif; the FHA domain is a 
putative nuclear signalling domain 
found in a variety of otherwise 
unrelated proteins. Similarity: 
the similarity to dmal of S. pombe 
is limited to the C-terminal half 
of the predicted ORF; the 
C-terminal half is very rich of 
low-complexity, serine-containing 
repeats, producing many alignments 
with proteophosphoglycans . Title: 
strong similarity to defective in 
mitotic arrest dmalp - 
Schizosaccharomyces pombe" 
/citation= [33] 
/codon-start=l 
/protein-id="CAK42066 .1" 



exon 

intron 

exon 

intron 

exon 

gene 

mRNA 

CDS 



exon 

intron 

exon 

gene 

mRNA 

CDS 



2852. .3927 
3928. .3988 
3989. .4085 
4086. .4148 
4149. .4742 

complement (<5413 . .>5620 
) 

complement ( join (<5413 . . 
5548,5604. .>5620) ) 
complement (join(5413. .5 
548, 5604. .5620) ) 



complement (5413. .5548) 
complement (5549 . .5603) 
complement (5604. .5620) 

complement (<6 405. .>7365 

) 

complement (join(<6405. . 
6781,6840 . . 7160, 
7215. .>7365) ) 
complement (join(6405. .6 
781,6840 . .7160, 
7215. .7365) ) 



/db-xref="GI : 134081811" 

/translation="MSVSASVSASASSNLPSDSR 

SSGPLRRLSQLRAYTQQHFSSSSS 

ATAS S SHPSAS S S S SNGPRI SRRHTLS SRVSWFS 

PASTGTESSQLSPVGSRSSGPCPE 

SEQPSTLARYSAVFFPSYRGLELDLRSSRQSDLS 

SSSPSNHSGGRETTTQGTTMARPR 

GASQALDARPDLDATTPSASNLHAGALDSSDPVM 

SENPSTASPRPKQKATIRFFPHQD 

SHQSSRPSLPFIPVSRTLPSESCVIRVGRYSERD 

GLPVANPTDPSDAPVGFKSKWSR 

KHCEFLYLNGQWHIKDVGSSSGTFLNHMRLSQPN 

MPSRLYTVKDGDIVQLGIDFRGGE 

EMIFRCVRIRIECNRSWQQQPNEFNKNTESLIRN 

LGKGDTADYSGCRECSICLGSVLR 

PYQCLFMAACAHVWHYKCVSRLIHTPDYPMFQCP 

NCRAYTDLSAEVDDTNDFDEEDEK 

KDTPEDKQDTTDASRSQTNSPQLEAQPAPSGDEP 

RQDNLPAEAGLAANIESMRLQDTD 

ASDDARRPRAPTSNNDGASANADNEVPGWQSVTQ 

SPNALQTRQSHLRADTPVRSESSD 

DNPLTPLNDSGPLALDGRAAMP " 

/locus-tag="Anl4g05050" 

/number=l 

/locus-tag="Anl4g05050" 
/number=l 

/locus-tag="Anl4g05050" 
/number=2 

/locus-tag="Anl4g05050" 
/number=2 

/locus-tag="Anl4g05050" 
/number=3 

/locus-tag="Anl4g0506 0" 
/locus-tag="Anl4g05060" 
/locus-tag="Anl4g05060" 
/ codon-start=l 

/product="hypothetical protein" 
/protein-id="CAK42067.1" 

/db-xref="GI:13 4081812" 
/translation="MSGMVCTIEGDELVSELGGL 
NSPSAKHHECCVFMSPRVMGTRKL ILQEAA" 
/locus-tag="Anl4g05060" 
/number=l 

/locus-tag="Anl4g05060" 
/ number=l 

/locus-tag="Anl4g05060" 
/number=2 

/locus-tag="Anl4g05070" 
/locus-tag="Anl4g05070" 



/locus-tag="Anl4g05070" 



/inf erence="similar to AA 
sequence : UniProtKB : NCX4G1 1 . 7 " 
/note="unnamed protein product; 
Title: similarity to hypothetical 



protein X4G11.070 - Neurospora 
crassa" 

/codon-start=l 
/protein-id="CAK42068 . 1" 
/db-xref="GI:13 4081813" 
/translation="MSGLMHKVKDAVTGHHKESS 
NTNTSNTSSNAGPHSSNMANKMDP 
RVDSDRDNRARHEAMGGAHGPHDSSMANKMDPRV 
DSDRDNRANPSGGLGGSGAGYSSG 
GYSSGPHESGMANKMDPRVDSDRDNRARHQAMGG 
AHGPHDSSMANKMDPRVDSDRDNR 
AAGG Y S GTHT T GAG YGGAGAGAG YGGGAG YGG S G 
VNDYTTSSGYGGSSNAGPHGSNMA 
NKMDPRVDSDMDNRARHQGMASSSYNAPGQTTAG 
PHSSNMANKLDPRVDSDLDNRGTL GTQRGF" 
exon complement (6405. .6781) /locus-tag="Anl4g05070" 

/number=l 

intron complement ( 6782 6839 ) /locus-tag="Anl4g05070" 

/number=l 

exon complement (6840 .. 7160) /locus-tag="Anl4g05070" 

/number=2 

intron complement ( 7161 .. 7214) /locus-tag="Anl4g05070" 

/number=2 

exon complement (7215. .7365) /locus-tag="Anl4g05070" 

/number=3 

gene complement ( <8900 .. >1022 /locus-tag="Anl4g05080" 

6) 

mRNA complement ( join (<8900. . /locus-tag="Anl4g05080 " 

9282,9368. .9550, 
9634. .9898, 
10206. .>10226) ) 

CDS complement (join(8900. .9 /locus-tag="Anl4g05080" 

282,9368. .9550, 
9634. .9898, 
10206. .10226) ) 

/inference= "profile: COGS :COG0 6 66" 
/inference="prof ile:PFAM:PF0 0 023" 
/note="unnamed protein product; 
Function: many ankyrin repeat 
regions are known to function as 
protein-protein interaction 
domains. Remark: closest manual 
BLASTP homologue: 

SWISSPR0T:ANK1 -HUMAN. Similarity: 
ankyrin repeats (ANK) are tandemly 
repeated modules of about 33 amino 
acids, that occur in a large 
number of functionally diverse 
proteins mainly from eukaryotes. 
Similarity: the predicted ORF 
shows similarity also to the ZU5 
domain present in ZO-1 and 
Unc5-like netrin receptors, which 
function remains unknown. 
Similarity: the similarity to 
ankyrin of H. sapiens and other 
proteins is mainly due to the 
presence of two ankyrin repeats. 
Title: weak similarity to ankyrin 
(variant 2.1) -Homo sapiens" 
/citation= [5] 
/ codon-start=l 



/protein-id="CAK42069 . 1" 
/db-xref="GI : 134081814" 
/translation="MSRATFMIRDCTKNFFQSGK 
PTSSNGHTPAIVETLRIELLNGLM 
GP AS G S L I T VI S E S GEKTMRRVF S GFQP I L AVGN 
PSLANYLVDELRPSSEKTAHRGHE 
SIIRILLDFGVKDLNSRDKSGHTPLLHAASMWHT 
KIVKLLLRTGIPDPNCQNAGAHET 
IMFGHRWPLPVKLVMKRSLPLLETGKMDSSLEDQ 
DGFTPLTYATEYGNERIIKLLLDA 
SDPRAAKTATTELEIFAATRATPELQPVHEGPW 
EATPSELVSRTSPKDIVRLRWRNN PALNPQH" 
exon complement(8900. .9282) /locus-tag="Anl4g05080" 

/number=l 

intron complement ( 9283 .. 9367 ) /locus-tag="Anl4g05080" 

/number=l 

exon complement (9368. .9550) /locus-tag="Anl4g05080" 

/number=2 

intron complement ( 9551 .. 9633 ) /locus-tag="Anl4g05080" 

/number=2 

exon complement (9634. .9898) /locus-tag="Anl4g05080" 

/number=3 

intron complement (9899 .. 10205) /locus-tag="Anl4g05080" 

/number=3 

exon complement (10206. .10226 /locus-tag="Anl4g05080" 

) 

/number=4 

gene <11182 . . >11344 /locus-tag="Anl4g05090" 

mRNA join (<11182. .11234, /locus-tag="Anl4g05090" 

11281. .>11344) 

CDS join(11182. .11234, /locus-tag="Anl4g05090" 

11281. .11344) 

/note="Similarity: BLASTP 
identifies a stretch of similarity 
to D. melanogaster genomic 
fragment CG6456; there is no 
sufficient reason to consider such 
a similarity significant." 
/ codon-start=l 

/product="hypothetical protein" 
/protein-id="CAK42070 .1" 

/db-xref="GI:13 4081815" 
/translation="MAGSPPNSFPGGNGLSEEFE 
GWLPGGKPTNCNETIGSR" 
exon 11182.. 11234 /locus-tag="Anl4g05090" 

/number=l 

intron 11235.. 11280 /locus-tag="Anl4g05090" 

/ number=l 

exon 11281.. 11344 /locus-tag="Anl4g05090" 

/number=2 

gene <12054. .>13438 /locus-tag="Anl4g05100" 

mRNA join (<12054 .. 12184, /locus-tag="Anl4g05100" 

12254. .12395, 

12454. .12995, 

13054. .>13438) 

CDS join(12054. .12184, /locus-tag="Anl4g05100" 

12254. .12395, 
12454. .12995, 
13054. .13438) 

/inference="prof ile: COGS: C0G3 781" 
/inference="profile:PFAM:PF05249" 
/inf erence="similar to AA 



exon 12054.. 12184 

intron 12185.. 12253 

exon 12254.. 12395 

intron 12396.. 12453 

exon 12454.. 12995 

intron 12996.. 130b3 

exon 13054.. 13438 

gene <13882 . . >14418 

mRNA <13882. .>14418 

CDS 13882. .14418 



sequence :PIR:AD2 179" 

/note="unnamed protein product; 

Title: strong similarity to 

hypothetical protein alr2987 - 

Nostoc sp" 

/codon-start=l 

/protein-id="CAK42071 .1" 

/db-xref="GI : 134081816" 

/translation="MRLNFSQPPTPALSRQNTTF 

LNNAHHYRAGTHPHLLTSRSKPRR 

WPLVFRFIKGAIHAAILLPVFLHAAFTAFWYLD 

VYVFDTVGLPSSIIPSLSIWGLM 

LVFRNQTSYNRFWDGRNGMNTIYTCIRNLVRTIV 

TNGYSTAGPPTAAEKEDIERTIRI 

LMS IPFAVKNHLRAEWGAAWALGNDVAENGVAAF 

NPDYASLLPAGLVGHEDEGLGLPF 

QLTFFVDGFIKRGVERGWFNAPGASQMQAQLNSL 

MDAYGKMETIKLTPIPVAHLIHQK 

QVLALFGCVLPFGMVDDMGWWTVPMVSLVIFTLY 

GIEGIGSQLEDPFGYDRNDIKMDA 

IVGDAKTEIDWLAEWRRLMASLESATDNGDQHL 

GNGLWEEEVKEPKFMPPDMFLKH RARPSGR" 

/locus-tag="Anl4g05100" 

/number=l 

/locus-tag="Anl4g05100" 
/number=l 

/locus-tag="Anl4g05100" 
/number=2 

/locus-tag="Anl4g05100" 
/number=2 

/locus-tag="Anl4g05100" 
/number=3 

/locus-tag="Anl4g05100" 
/number=3 

/locus-tag="Anl4g05100" 
/number=4 

/locus-tag="Anl4g05110" 
/locus-tag="Anl4g05110" 
/locus-tag="Anl4g05110" 
/inf erence="similar to AA 
sequence:UniProtKB:BT004329 .1" 
/note="unnamed protein product; 
Similarity: the predicted ORF 
contains an EF-hand,the most 
common domain responsible for 
calcium binding in proteins like 
calmodulin, myosin regulatory 
light chains, parvalbumins, 
troponins C, and neuronal calcium 
sensors. Title: weak similarity to 
hypothetical protein AT4g01140 - 
Arabidopsis thaliana" 
/codon-start=l 
/protein-id="CAK42072 .1" 
/db-xref ="GI :134081817" 
/translation="MNLLPLLLTLTTFLLPIATA 
FTSTFENCQNLNRLNQTVHRVHPL 
LEKFSSIHASQSQSSTTTTTTLANGTTITVTNTT 
SSAAIEKIEGVKALLLSSQEKIYG 
RLSNCSSDAVSPRNVLEKGVKRDDDDDGSCTLND 
VLDQLVDTLECVLSFATGLLETIL 
DGLFDLLKTIIEGVEKLL" 



sig-peptide 13882.. 13941 /locus-tag="Anl4g05110 " 

/ inf erence="protein 
motif :SignalP: 2.0" 

mat-peptide 13942.. 14415 /locus-tag="Anl4g05110" 

/product=" unnamed" 

exon 13882.. 14418 /locus-tag="Anl4g05110" 

/number=l 

gene complement (<14885 .. >163 /locus-tag="Anl4g05120" 

07) 

mRNA complement ( join (<14885 . /locus-tag="Anl4g05120" 

.15101, 15159 . .15313, 
15370. .15531, 
15585. .15698, 
15751. .15900, 
15955. .16104, 
16161 . .>16307) ) 

CDS complement( join(14885. . /locus-tag="Anl4g05120" 

15101,15159. .15313, 
15370. .15531, 
15585. .15698, 
15751. .15900, 
15955. .16104, 
16161. .16307) ) 

/inference="profile:COGS:COG0714" 
/inf erence="similar to AA 
sequence : UniPr otKB : NCI 7E5 . 2 7 " 
/note="unnamed protein product; 
Function: MoxR is a poorly 
characterized bacterial protein 
which may be involved in the 
regulation of formation of active 
methanol dehydrogenase. 
Similarity: the predicted ORF 
shows weak similarity to many 
putative proteins classified as 
MoxR-like ATPases. Title: strong 
similarity to hypothetical 
conserved protein 17E5.290 - 
Neurospora crassa" 
/codon-start=l 
/protein-id="CAK42073 .1" 
/db-xref="GI: 134081818" 
/translation="MEDGSEISRLAPELSDLEVA 
LFLCLAAHQHCRIDTTDANIHDVA 
KELALICTNTFGLSYSILDCSSATSLDDFRAELL 
PPGVYRSSYARPSRSRLTTESAGM 
ISSYHDLRDHKGRLKSAPIEHKQEWNWIAKNF 
NHVNDNIQREALELMHSKQLTTPA 
GVLEAPNNFLFLPLIVRDETHPINPHLNDYLFIS 
HFHDTEDGYVYLEENNDWLSDGQL 
SASSVIHKPEAQMKKGHPYVDSTLLGQLQQLSTT 
VSMGADIARYQQDIWFLRLSRAV 
AGGISTRSNLYFKSFSKLLAVLHGIDFLTPSIVA 
LAARKVFRHRI IVAKPEEDRSLQY 
GSDLHAVSQVLEYANPDSILDGVLTLEAPL" 
exon complement (14885. .15101 /locus-tag="Anl4g05120" 

) 

/ number=l 

intron complement ( 15102 .. 15158 /locus-tag="Anl4g05120" 

) 

/number=l 

exon complement (15159. .15313 /locus-tag="Anl4g05120" 



intron 



/number=2 

complement (15314. . 15369 /locus-tag="Anl4g05120" 



/number=2 

complement (15370 . . 15531 /locus-tag="Anl4g05120" 



/number=3 

complement (15532 . . 15584 /locus-tag="Anl4g05120" 



/number=3 

complement (15585. . 15698 /locus-tag="Anl4g05120" 



/number=4 

complement (15699. .15750 /locus-tag="Anl4g05120" 



/ number=4 

complement ( 15751 . . 15900 /locus-tag="Anl4g05120" 



/number=5 

complement (15901 . . 15954 /locus-tag="Anl4g05120" 



/number=5 

complement (15955. . 16104 /locus-tag="Anl4g05120" 



/number=6 

complement (16105. . 16160 /locus-tag="Anl4g05120" 



/number=6 

complement (16161. .16307 /locus-tag="Anl4 



<16bb7. .>19301 
join (<16bb7 . . 16701, 
167b8 . . 16864, 
16910. .17038, 
17088 . .17676, 
17726. .18996, 
19053. .>19301) 
join(16557. .16701, 
16758. .16864, 
16910. .17038, 
17088 . .17676, 
17726. .18996, 
19053. .19301) 



/number=7 

/locus-tag="Anl4g05130" 
/locus-tag="Anl4g05130" 



/locus-tag="Anl4g05130" 



/inference="prof ile:PFAM:PF0 4 8 40" 
/inference="prof ile:PFAM:PF0 4 8 41" 
/note="unnamed protein product; 
Complex: VPS16 of S. cerevisiae is 
a component of the so called class 
C Vps complex, required for 
vacuolar protein sorting and 
morphology. Function: VPS16 of S. 
cerevisiae is required for 
vacuolar protein sorting and 
vacuolar biogenesis and stability. 
Phenotype: S. cerevisiae VPS16 
mutants have pleiotropic defects 
in vacuolar morphology and 
vacuolar protein targeting. Title: 
strong similarity to vacuolar 
protein sorting-associated protein 



Vpsl6 - Saccharomyces cerevisiae 
vacuole" 
/citation=[2] 
/citation=[ll] 
/citation=[13] 
/citation=[42] 
/codon-start=l 
/protein-id="CAK42074.1" 
/db-xref="GI : 134081819" 
/translation="MAPSNPLANWERLGDSFYRK 
VPIYDAIFDDDVELENYIIAGAPY 
GGAIALHRDESKPYRFRDAQTAKSSIDIYSCSGK 
HINRINWEYGTIRGLGWSDKEELL 
VITEDGTVRRYFGLHGDFTSFSLGNGAEDYGVRA 
CRFWTSGFVALLSNNQLVAVSNYD 
EPRPRLLAPCPEGEVSSWSLIPPAYTLSRSVEVL 
LAVDKTVYLVDPTEAEDKVLQNGP 
FKHASVSPTGRFVAL I TAEGKVWWS SDFQSKYS 
EYDPESRVTPRTVDWCGDDAWIA 
WEDEVHLIGPNGVAARYYYDGTVHWPEFDGVRL 
ITNDTCEFLHKWDVTEAIFRLGS 
TSPASVLLDSIDLLEKKSPKADENIQRIRPSLPE 
AVDTCVKAAGHEFDTYWQKRLLKA 
ASFGKSVLDLYNSDEFVEMTEKLRVLKAVRDYQI 
GLPLSYEQYMRLTPERLIERLVNR 
HEYLLAIRISEYLELPADRIYVHWASQKVKVSTV 
DDDAVCKLIVQRLEGKPGISFELI 
AQTAYDEGRAHLATQLLNHEPRAGKQVPLLLDME 
EDEIALDKAIESGDVDLVNYVLLH 
LKTKLPLASFFRTINTRPMASALVETTARGQDTE 
LLKDLFYQDDRPIDGSNVLLSEAL 
DATDLPRKTEKLQLASRLLSDSKDPSWLQQKLL 
NEASQLLKVQEALDKDLADRSEFL 
GLSLNETIYRLIRSGYGKRAQKIQSEFRMPEKTF 
WWLRLRALVAKRDWGELEEIGKNK 
KSPIGWEPFYNEILGAGNTKLASLFVPKCTNLPA 
E DKMEMWVKC GM I AKAGE E AF RAK 
NVNALELLQARASGPAAVEIERMINQLRPRK" 



exon 


16557. 


.16701 


/locus-tag= 
/number =1 


"Anl4g05130' 


intron 


16702. 


.16757 


/locus-tag= 
/number=l 


"Anl4g05130' 


exon 


16758. 


.16864 


/ locus-tag= 
/number=2 


"Anl4g05130 


intron 


16865. 


.16909 


/ locus-tag= 
/number=2 


"Anl4g05130' 


exon 


16910. 


.17038 


/locus-tag= 
/number=3 


"Anl4g05130' 


intron 


17039. 


.17087 


/locus-tag= 
/number=3 


"Anl4g05130' 


exon 


17088. 


.17676 


/locus-tag= 
/number=4 


"Anl4g05130' 


intron 


17677. 


.17725 


/locus-tag= 
/number =4 


"Anl4g05130 


exon 


17726. 


.18996 


/locus-tag= 
/ number=5 


"Anl4g05130' 


intron 


18997. 


.19052 


/locus-tag= 
/ number=5 


"Anl4g05130' 


exon 


19053. 


.19301 


/locus-tag= 
/number=6 


"Anl4g05130' 


gene 


complement (<19354 . 


.>199 /locus-tag= 


"Anl4g05140' 



complement ( join (<19354 . 
.19554, 19615. .19907, 
19981. .>19996) ) 
complement (join ( 19354 . . 
19554, 19615. .19907, 
19981. .19996) ) 



/locus-tag="Anl4g05140" 



/locus-tag="Anl4g05140" 



/note="unnamed protein product; 

Phenotype: ECMl mutation in S. 

cerevisiae causes perturbation of 

the cell surface. Remark: ECMl of 

S. cerevisiae is also called 

YAL059W. Title: similarity to Ecml 

- Saccharomyces cerevisiae" 

/citation=[39] 

/codon-start=l 

/protein-id="CAK42075. 1" 

/db-xref="GI: 13 4081820" 

/translation="MAKSRPQSKHSRAARRAASP 

SLDVDKSLTSLPRAEETTVQRDSI 

L S ERANAGVS KKQ S KGKAKTRAQRLRQQKGVE RA 

EAVMDQLEKKVTKSEVRAKSVKAR 

RAEWEDLNRKTKSMFEALNEEADDNMDDAMVDDA 

AASKPAKRSKPAPVTQTPWEEHE 

GIDVDDDIT" 

complement (19354. .19554 /locus-tag="Anl4g05140" 



complement (19555. .19614 



/number=l 

/locus-tag="Anl4g05140" 



gene 
mRNA 
CDS 



complement ( 19615 . 
complement (19908. 
complement (19981. 



<20272. .>22263 
<20272. .>22263 
20272. .22263 



/number=l 
.19907 /locus-tag="Anl4g05140" 

/number=2 
.19980 /locus-tag="Anl4g05140" 

/number=2 
.19996 /locus-tag="Anl4g05140" 

/number=3 

/locus-tag="Anl4g05150" 
/locus-tag="Anl4g05150" 
/locus-tag="Anl4g05150" 
/EC-number="3.1.2.15" 
/inference= "profile: COGS :COG5533" 
/inf erence="prof ile : COGS : COG5560 " 
/inference="prof ile:PFAM:PF00443" 
/inf erence="similar to AA 
sequence :PIR:S6 7665" 
/note="unnamed protein product; 
Catalytic activity: ubiquitin 
C-terminal thiolester + H20 = 
ubiquitin + thiol. Function: UBPl 
of S. cerevisiae has an 
ATP-independent isopeptidase 
activity, cleaving at the carboxyl 
terminus of the ubiquitin moiety 
in natural or engineered linear 
fusion proteins, irrespective of 
their size or the presence of an 
amino-terminal extension to 
ubiquitin. Similarity: UBPl 
belongs to peptidase family C19; 



also known as family 2 of 

ubiquitin carboxyl-terminal 

hydrolases. Similarity: although 

the predicted ORF is shorter than 

UBPl of S. cerevisiae and the 

alignment contains several gaps, 

it shows consistent similarity to 

several ubiquitin-specif ic 

processing proteases. Title: 

similarity to ubiquitin specific 

protease Ubpl - Saccharomyces 

cerevisiae" 

/citation=[7] 

/citation=[12] 

/citation=[27] 

/citation= [64] 

/codon-start=l 

/protein-id="CAK42076 .1" 

/db-xref="GI : 134081821" 

/db-xref="GOA:A2R3Q8" 

/translation="MNARREGIDFLTPNPFNVRH 

SNYESILQQLQGNPSLVASILVTV 

ITLYLAVSYLGPASLSVARIAWNILVYLTPSRLI 

AALDSKTPKSDENPLSLTFEAKSE 

AMQRILGLDDTSFSSLFPRAPAFSSFGTSLLSSK 

NNVPPGLGNWDNSCYQNSIIQGLA 

SLQSLERFLDQNVEQLGQKALLSTHQALKDIIER 

LNSADSNGQRL WTPADLKSMS S WQ 

QQDAQEYFSKIVDQLDLEVQQATRRHTRNLGLKM 

AGPQEHVIGSGISQELQESSAGET 

RITGNQIFRNPLEGLLAQRVGCIQCGWTEGLSLI 

PFNCLTVPLGPKFEYDIRECLHHY 

MHLEPIEGVECAKCTLLRVQSQLLNLLKQIGDDE 

ETPSATPESPKISDALRSSAQERL 

QAVEQALEEEDFAEKTLSQKCHIPGKNRVSSTKS 

RQAWARPPQCLVIHVNRSMFDEN 

TGMLRKNYAAVKFPNALDLNPWCLGGVSKNQSEP 

SLEAWETNPSVSMLSHAGRGANAG 

GHYQLRAVITHYGRHENGHYICYRKYPTDTFPAH 

VPDAIIEADGDKERDERWYRLSDE 

DVQMVSEANVMSQGGAFMLFYEAVEDYSPEAAED 

VDSGLAEEEGSAPSSSCTPSETMS 

TTSAATGDSTSDVSQATSVSTALNVEPRVEKSLL 

MS DVD" 

20272. .22263 /locus-tag="Anl4g05150" 
/number=l 

complement (<22652 . .>233 /locus-tag="Anl4g05160" 
65) 

complement (join (<22652 . /locus-tag="Anl4g05160" 
.23018,23085. .23287, 
23363. .>23365) ) 

complement (join (22652 . . /locus-tag="Anl4g0516 0" 
23018,23085. .23287, 
23363. .23365) ) 

/inference="prof ile:COGS :COG5126" 
/inf erence="similar to AA 
sequence :PIR: 13 8 424" 
/note="unnamed protein product; 
Function: centrin is a ubiquitous 
component of centrosomes and 
mitotic spindle poles of diverse 
organisms and plays a role in 



centrosome separation at the time 
of mitosis. Localization: human 
centrin is localized at the 
centrosome of interphase cells and 
redistributes to the region of the 
spindle poles during mitosis. 
Similarity: in the predicted ORF 
the EF-hands are not well 
conserved; this fact might have 
important functional consequences. 
Similarity: the human centrin 
sequence has four putative 
calcium-binding domains as defined 
by the EF-hand consensus. Title: 
similarity to centrin - Homo 
sapiens centrosome" 
/citation=[16] 
/codon-start=l 
/protein-id="CAK42077.1" 
/db-xref="GI : 134081822" 
/translation="MPPKKRSAPSSGPAPKRARQ 
SKLAKENDISASEENEIKEVFHLF 
SETVEEFADQKEGVIPRGDVRKALVALGLDPTDS 
EELHSIIEAVDPTDTGYVPYEPFL 
AVAAAKLRSRSDDAMAAEVDAAYRLFTRGSGGVI 
TLNHLRRIARELKEEELGDELLKD 
MILEANGGAGVHAGVTLEQFHDVMTRAGVF" 
complement (22652 . .23018 /locus-tag="Anl4g05160" 

/number=l 

complement (23019 . .23084 /locus-tag="Anl4g05160" 
/number=l 

complement (23085. .23287 /locus-tag="Anl4g05160" 
/number=2 

complement (23288 . .23362 /locus-tag="Anl4g05160" 
/number=2 

complement (23363. .23365 /locus-tag="Anl4g05160" 



<23835. .>25756 
join(<23835. .23848, 
23890. .23947, 
24000. .24221, 
24272 . .>25756) 
join(23835. .23848, 
23890. .23947, 
24000. .24221, 
24272. .25756) 



/number=3 

/locus-tag="Anl4g05170" 
/locus-tag="Anl4g05170" 



/locus-tag="Anl4g05170" 



/EC-number="l 

/inference="profile: COGS :C0G3 118" 
/inference="prof ile:PFAM:PF0 0 0 85" 
/note="unnamed protein product; 
Function: thioredoxin participates 
in various redox reactions through 
the reversible oxidation of its 
active center dithiol to a 
disulfide, and catalyzes 
dithiol-disulf ide exchange 
reactions. Similarity: the main 



exon 23835.. 23848 

intron 23849.. 23889 

exon 23890.. 23947 

intron 23948.. 23999 

exon 24000.. 24221 

intron 24222.. 24271 

exon 24272.. 25756 

gene <27298 . . >27621 

mRNA <27298. .>27621 

CDS 27298.. 27621 



feature of the predicted ORF,as 

well as of the very similar N. 

crassa hypothetical protein 17E5. 

270, is to contain different 

structural domains, including the 

thioredoxin and the leucine 

zippers in the C-terminal region. 

Similarity: the similarity to 

chicken thioredoxin and similar 

proteins of other species is 

limited to a single domain of the 

predicted ORF. Title: similarity 

to thioredoxin - Gallus gallus" 

/citation= [ 1 ] 

/codon-start=l 

/protein-id="CAK42078 .1" 

/db-xref="GI : 134081823" 

/db-xref="GOA:A2R3R0" 

/translation="MDVELCTCSVTESWANAEQG 

LARMYSLALTGTQIDAIYHTSLVL 

NGVEYYFGQGIQTAIPGSTHHGQPMEKLHLGKTE 

LPLDVIEEYIQSLAEIYTPESYDL 

FLHNCNNFTQDLAMFALGKGIPEHIQNLPQTFLS 

TPFGQMMKPQIEMALRGVTQGTGA 

GTGTVGTQTPTTSAPTAPAQPAPVTQGSVRIASN 

LAQLEHHLAAAADSCAVIFFTSAT 

CPPCKMVYPTYDELAEEAGAKATLIKVDISTAMD 

VSMKYSVRATPTFMTFLKGQKLDE 

WSGANPAQLRGNVRLLLEMAHPPHRHQQLRLPSL 

QRPITNYVTYKKVPPLDKLVQKLD 

PHHEDPRLLSMITYLKHRTSSSTPAADTPLPQDL 

PSFATYLQTTCGFLALDHLFALVD 

LTRLLFLDPRVSGYFAEEPGHTTLLTLLSPSAGL 

SGCPYNLRIVMLQLCCTLFSTPLY 

RDQLTTSSSLLPTLLHLTTSSLLDSHTNLRWAA 

S LAYNLAALNHNARF AGHADP L S E 

ESQVELTASLVEAITQEEESQEALHGLLFALGLL 

VYEASPDSAWDLCKAMGIAETVA 

AKRNVEKVAKEPLIKEVGEELLMKGL" 

/locus-tag="Anl4g05170" 

/number=l 

/locus-tag="Anl4g05170" 

/ number=l 

/locus-tag="Anl4g05170" 
/ number=2 

/locus-tag="Anl4g05170" 
/number=2 

/locus-tag="Anl4g05170" 
/number=3 

/locus-tag="Anl4g05170" 
/number=3 

/locus-tag="Anl4g05170" 
/ number=4 

/locus-tag="Anl4g05180" 
/locus-tag="Anl4g05180" 
/locus-tag="Anl4g05180" 
/note="unnamed protein product; 
Title: weak similarity to 
hypothetical As-rell -Halocynthia 
roretzi" 
/codon-start=l 
/protein-id="CAK42079 .1" 



gene 
mRNA 
CDS 



exon 

intron 

exon 



27298. .27621 

complement (<2 79 49 . 
78) 

complement (<2 79 49 . .: 
78) 

complement (27949 . .21 
) 



/db-xref="GI : 134081824" 
/translation="MSPQFPSNAPAATEKLGNQA 
AKLGHQ I RGREGATPACQFRPATA 
PTNYLSSLRLPFLRRTCTLCRHIPPVPCPIMLIH 
GPVLESPSLVMSQTPKHQCQKCVI VAGGP" 
/locus-tag="Anl4g05180" 
/ number=l 
>285 /locus-tag="Anl4g05190" 

>285 /locus-tag="Anl4g05190" 

i578 /locus-tag="Anl4g05190" 

/note="unnamed protein product; 

Title: weak similarity to MLL - 

Fugu rubripes" 

/citation=[55] 

/codon-start=l 

/protein-id="CAK42080 . 1" 

/db-xref="GI : 134081825" 

/translation="MYPFLPWSQNQSSVRSSLSE 

ATTSTQNVSFSFSFGSSCSPTSMP 

PPPPPSPTDSLLDITPRKCSFSSGYEMNNSCAFP 

SWPNRPSLLSADSDSSTASAYLSD 

EDLLPIGSGSPCESAIDEESAAQDPTMGDVDLTT 

EQQIQMIRAAAEEEAQRARFLAQV 

QAHARAQQAMRVAQMAAAERENAKRKKRKAIPER 

KRRT AS AS KAT VCRA " 



exon 


complement (27949 . .2i 


3578 /locus-tag="Anl4c 


J05190' 




) 


/number=l 




gene 


<28642. .>29713 


/locus-tag="Anl4c 


J05200' 


mRNA 


join(<28642. .28681, 


/locus-tag="Anl4c 


J05200' 




28733. .28801, 








28901. .29001, 








29206. .29402, 








29446. .29476, 








29582. .>29713) 






CDS 


join(28642. .28681, 


/locus-tag="Anl4c 


J05200' 




28733. .28801, 








28901. .29001, 








29206. .29402, 








29446. .29476, 








29582. .29713) 







28642. .28681 
28682. .28732 
28733. .28801 



/note="unnamed protein product; 

Title: questionable ORF" 

/ codon-start=l 

/protein-id="CAK42081 . 1" 

/db-xref="GI : 134081826" 

/translation=" MGVNLG WQ S DAGME VKK I D 

TVWGGQDPRVEKGGVGGECINGRR 

ARSIGGGVWTPLHLPTDKASWRTKKEPDKAARIT 

WNFPQCWPESKASTGNPACQARRS 

GGRGVP VAVRWMKQQ S SWMDTREYYHCSSSGSGR 

YVDDVGDVAAMFELISPVRCSSEL 

LKSIIGPGTKGTYLSIIMIYSHRVDSTLI" 

/locus-tag="Anl4g05200" 

/number=l 

/locus-tag="Anl4g05200" 
/number=l 

/locus-tag="Anl4g05200" 
/number=2 



intron 

exon 

intron 

exon 

intron 

exon 

intron 



exon 

intron 

exon 

gene 

mRNA 



28802. .28900 
28901. .29001 
29002. .29205 
29206. .29402 

29403. .29445 

29446. .29476 

29477. .29581 

29582. .29713 

<30773. .>31030 
join(<30773. .30823, 
30902. .>31030) 
join(30773. .30823, 
30902. .31030) 



/locus-tag= 

/number=2 

/locus-tag= 

/number=3 

/locus-tag= 

/number=3 

/locus-tag= 

/number=4 

/locus-tag= 

/number=4 

/locus-tag= 

/number=5 

/locus-tag= 

/number=5 

/locus-tag= 

/ number=6 

/ locus-tag= 

/locus-tag= 



"Anl4g05200" 
"Anl4g05200" 
"Anl4g05200" 
"Anl4g05200" 

"Anl4g05200" 

Anl4g05200" 

"Anl4g05200" 

"Anl4g05200" 

"Anl4g05210" 
"Anl4g05210" 



30773. .30823 

30824. .30901 

30902. .31030 

complement (<32776 . .>347 
62) 

complement ( join (<3 2776 . 

.32888, 32979. .33084, 

33161. .33253, 

33402. .33450, 

33632. .33776, 

33997. .34090, 

34194. .34227, 

34305. .>34762) ) 

complement (join (32776 . . 

32888,32979. .33084, 

33161. .33253, 

33402. .33450, 

33632. .33776, 

33997. .34090, 

34194. .34227, 

34305. .34762) ) 



/locus-tag="Anl4g05210" 

/note="unnamed protein product; 
Title: weak similarity to protein 
fragment SEQ ID NO: 24290 from 
patent EP1033405-A2 - Arabidopsis 
thaliana" 
/codon-start=l 
/protein-id="CAK42082 .1" 
/db-xref="GI : 134081827" 
/translation=" MLPMDVD I HPPVCMSMQL VS 
YQLGPKVTKMVSNAFRLFASRVGF 
TTHPIPSLESPSTPF" 
/locus-tag="Anl4g05210" 
/number=l 

/locus-tag="Anl4g05210" 
/number=l 

/locus-tag="Anl4g05210" 
/number=2 

/locus-tag="Anl4g05220" 
/locus-tag="Anl4g05220" 



/locus-tag="Anl4 



/note="unnamed protein product; 
Title: weak similarity to 
furin-like protein 1 Furl - 
Drosophila melanogaster " 
/citation=[14] 
/ codon-start=l 



complement (32776 . .32888 

complement (32889. .32978 

complement (32979 . .33084 

complement (33085. .33160 

complement (33161 . .33253 

complement (33254. .33401 

complement (33402. .33450 

complement (33451. .33631 

complement (33632. .33776 

complement (33777. .33996 

complement (33997. .34090 

complement (34091 . .34193 

complement (34194. .34227 

complement (34228.. 3 43 04 

complement (34305. .34762 

<35094. .>35591 
join(<35094. .35340, 



/protein-id="CAK42083 .1" 

/db-xref="GI : 134081828" 

/translation="MRTNNGGSGESPLFGSFDSP 

CRHSTVDWQWEGKDTSREGTVQN 

AAHGGVAPRHHHVSWEACLALKSCGWVENAGGYP 

GSWDDDGQGGSAPISWMDRLDRKC 

STQSRGQSGMLGLHTRMRQGGEQANVSKSRKSKS 

ISKSTGVPQNGNPEYFWKCAAEAE 

WNAHDQLKGTQYFLLDFHPKRTTAATGAGGSCPA 

HESYRIGKRRSPSQKGKLLLIIYS 

VPKRTIAGAVETIQSSTAVRSFIGGGQVQRKPSL 

TTEVWAHRQQQQQQQPQDEAIQDG 

QINSVLVMACDGWCPSLGRSGHGIGVLVMILNSY 

FVWDDQCQFDAFHTASCSLARLTS 

SRGTYAKKSRIGSFVQIHPNEPSSTKYTV" 

/locus-tag="Anl4g05220" 

/number=l 

/locus-tag="Anl4g05220" 
/number=l 

/locus-tag="Anl4g05220" 
/number=2 

/locus-tag="Anl4g05220" 

/number=2 

/locus-tag="Anl4g05220" 
/number=3 

/locus-tag="Anl4g05220" 
/number=3 

/locus-tag="Anl4g05220" 
/number=4 

/locus-tag="Anl4g05220" 
/number=4 

/locus-tag="Anl4g05220" 

/number=5 

/locus-tag="Anl4g05220" 
/ number=5 

/locus-tag="Anl4g05220" 
/number=6 

/locus-tag="Anl4g05220" 
/number=6 

/locus-tag="Anl4g0522 0" 
/number=7 

/locus-tag="Anl4g0522 0" 
/number=7 

/locus-tag="Anl4g0522 0" 
/number=8 

/locus-tag="Anl4g05230" 
/locus-tag="Anl4g05230" 



exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



35443. .35474, 
35526. .>35591) 
join(35094. .35340, 
35443. .35474, 
35526. .35591) 



35094. .35340 

35341. .35442 

35443. .35474 

35475. .35525 

35526. .35591 

complement (<35692 . .>374 
65) 

complement { join (<35692 . 
.35854,36343. .36411, 
36584. .36607, 
36775. .36906, 
36960. .36967, 
37367. .>37465) ) 
complement ( join (35692 . . 
3b8b4, 36343 . .36411, 
36584. .36607, 
36775. .36906, 
36960. .36967, 
37367. .37465) ) 



/locus-tag="Anl4g05230" 



/codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK42 0 8 4. 1" 

/db-xref="GI : 134081829" 

/translation="MEKAHTTPSGEDRICCTTIS 

TGNGSRSTDREIRCTPRSGCSRRM 

PGYGSVCTGFPVWQLELPSWGIHDPATGLSEQSG 

NNFPGEVCGEGERGQGRLQKSKGC 

IIDMTARSPCAL" 

/locus-tag="Anl4g05230" 



/number= 
/locus-tag="Anl4g05230" 
/number=l 

/locus-tag="Anl4g05230" 
/number=2 

/locus-tag="Anl4g05230" 
/number=2 

/locus-tag="Anl4g05230" 
/number=3 

/locus-tag="Anl4g05240" 
/locus-tag="Anl4g05240" 



/locus-tag="Anl4 



complement (35692. .35854 
) 

complement (35855. .36342 
) 

complement (36343. .36411 
) 

complement (36412. .36583 
) 

complement (36584. .36607 



/note="unnamed protein product; 
Title: questionable ORE" 
/ codon-start=l 
/protein-id="CAK42085 . 1" 
/db-xref="GI: 13408 1830" 
/translation="MPNHRPARRNKRFLNIYHPP 
IQPSIPIIHQDLEVKQTNGLTYLT 
TYWIAIYPSRVMWDVIIVISVMFVDDGDVRLYDK 
AIVGYMADGRKHEDYSHSHYILSA 
LGGGFATPRTPARVGRLLYEPKRSVAAWNNGAHA 
DIERYRKPDSRGQMDMREKDNYGV DNSK" 
/locus-tag="Anl4g05240" 

/number=l 

/locus-tag="Anl4g05240" 
/number=l 

/locus-tag="Anl4g05240" 
/ number=2 

/locus-tag="Anl4g05240" 
/number=2 

/locus-tag="Anl4g05240" 



/number=3 

complement (36608 . .36774 /locus-tag="Anl4g05240" 



/number=3 

complement (36775. .36906 /locus-tag="Anl4g05240" 



/number=4 

complement (36907. .36959 /locus-tag="Anl4g05240" 



/number=4 

complement (36960 . .36967 /locus-tag="Anl4g05240" 



/number=5 

complement (36968. .37366 /locus-tag="Anl4g05240" 



/number=5 

complement (37367. .37465 /locus-tag="Anl4g05240" 



<40106. .>40899 
join(<40106. .40139, 
40184. .>40899) 
join(40106. .40139, 
40184. .40899) 



/number=6 

/locus-tag="Anl4g05250" 
/locus-tag="Anl4g05250" 

/locus-tag="Anl4g05250" 

/EC-number="l 

/inference="prof ile:COGS:COG102 8" 
/inference="prof ile:PFAM:PF0 0106" 
/note="unnamed protein product; 
Function: it is suggested that the 
0RFL15 protein of S. spinosa is 
involved in oxido-reduction during 
spinosyn biosynthesis. Remark: 
Spinosyns are insecticidal 
microlides which are useful for 
the control of arachnids, 
nematodes and insects. Similarity: 
SDR is a very large family of 
enzymes, most of which are known to 
be NAD- or NADP-dependent 
oxidoreductases with different 
specificities. Similarity: the 
predicted ORF shows strong 
similarity to several hypothetical 
and described members of the short 
chain dehydrogenase (SDR) protein 
family. Title: strong similarity 
to protein involved in spinosyn 
biosynthesis 0RFL15 from patent 
W09946387-A1 -Saccharopolyspora 
spinosa" 
/ codon-start=l 
/protein-id="CAK42086 .1" 
/db-xref="GI : 134081831" 
/db-xref="GOA:A2R3R8" 
/translation="MAAEQKLVLITGANQGIGFE 
TAKNLILSDNYHVILGSRDPAKGE 
EAAKTLEAVPGIKGSVSSIQIDVTDDQSVDNAAA 
Q I KAQ YGRL D I L VNNAAMS SMKHP 
PSREAMRQILDVNWGALSTTEAFLDLLRNSSEK 
RLVFVSSSTGSISRAADPSSPFHI 
ASATEYRASKAALNMMMVLYMCRLKDEGFKVFGA 



exon 

intron 

exon 



40106. .40139 

40140. .40183 

40184. .40899 

<41237. .>43709 
join (<41237 . . 41527, 
41586 . . 41762, 
41850. .43150, 
43289. .>43709) 
join(41237. .41527, 
41586. .41762, 
41850. .43150, 
43289. .43709) 



DPGLCATNLTGDPESLRRRNAAEP 
SDGGERVATWKGERDADVGKVLGVYGVSPF " 
/locus-tag="Anl4g05250" 
/ number=l 

/locus-tag="Anl4g05250" 
/ number=l 

/locus-tag="Anl4g05250" 
/number=2 

/locus-tag="Anl4g05260" 
/locus-tag="Anl4g05260" 



/locus-tag="Anl4 



/EC-number="6 . 1.1.16" 
/inf erence="prof ile:COGS :COG0215" 
/inference="prof ile:PFAM:PF01406" 
/inf erence="similar to AA 
sequence :PIR:S63220" 
/note="unnamed protein product; 
Catalytic activity: ATP + 
L-cysteine + tRNA(cys) = AMP + 
pyrophosphate + 

L-cysteinyl-tRNA(cys) . Similarity: 

YNL247W of S. cerevisiae belongs 

to class-I aminoacyl-tRNA 

synthetase family. Title: strong 

similarity to cysteine — tRNA 

ligase YNL2 47w - Saccharomyces 

cerevisiae" 

/citation= [24] 

/citation=[48] 

/citation=[82] 

/codon-start=l 

/protein-id="CAK42087.1" 

/db-xref="GI : 134081832" 

/db-xref="GOA:A2R3R9" 

/translation="MESRQQPPWQQPPAHPDTRL 

PPLKVWNSLTRSKTPFVPLDPKGR 

KVKWYACGPTVYDDAHLGHARNYVSTDILRRILR 

DYFKFDVEFVMNITDVDDKIILRG 

RQQHLFNKYIAEHPTVTPDVLETARKAYSAYIKK 

NLPLVDPDTEPENFVAEAQKTVTA 

TTSAIAEAAQKLAGIDAASSKAFYDAAQDVFCFY 

LDVTEGSTIPGDAHEIFTKLTKKY 

EDHFMRDMRDLNVLDPDAVTRVTEYGQQIADFVE 

KIVANKFGYVTSDGSVYFDIKAFE 

EAGNHYARLEPWNRNNQPLLRDGEGALSRATEKK 

SSDDFALWKASRPGEPSWSSKWGQ 

GRPGWHIECSAMASSCLGSQIDIHSGGIDLAFPH 

HDNELAQSEAYWCEHKQQWVNYFL 

HMGHLSIQGSKMSKSLKNFTTVKDALERGDYTPR 

SLRIVFLLGGWRDGVEITDDLIKN 

ASSWEEKLNNFFVKAKDPSSFRSSDEAPTSSSET 

LSQALKSTQEKVHEYFCDSFDTPK 

VMAAISELVTTFNALDSQTLDLKWESMGTWVTQ 

IVTIFGLNGAASSDSCGIGWEGTD 

IPEAAKRFLYPLSAMRDTLRQAAILDDTKQASKD 

ILSLCDRLRNVDLFNLGIYLEDRE 



exon 

intron 

exon 

intron 

exon 

intron 

exon 



exon 

intron 

intron 

exon 

intron 



41528. .41585 

41586 . . 41762 

41763. .41849 

41850. .43150 

43151. .43288 

43289. .43709 

<43954. .>45254 
join(<43954. .43984, 
44194. .44823, 
44878. .45059, 
45108. .>45254) 
join(43954. .43984, 
44194. .44823, 
44878. .45059, 
45108. .45254) 



43954. .43984 

43985. .44193 

44194. .44823 

44824. .44877 

44878. .45059 

45060. .45107 

45108. .45254 

<46541. .>47683 
join(<46541. .47034, 
47092. .47344, 
47501. .>47683) 
join(46541. .47034, 



NKPALVRPVTKDMLQAREEQARKALLKQQEKEKQ 

EKLAQERLEKGKLNPVEMFRTSEY 

SAWDEDGIPTKDAAGEPLAKSKSKKLRKDWERQK 

KAHEAWLASQNGK" 

/locus-tag="Anl4g05260" 

/ number=l 

/locus-tag="Anl4g05260" 
/number=l 

/locus-tag="Anl4g0526 0" 
/number=2 

/locus-tag="Anl4g0526 0" 
/number=2 

/locus-tag="Anl4g05260" 
/number=3 

/locus-tag="Anl4g05260" 
/number=3 

/locus-tag="Anl4g05260" 
/number=4 

/locus-tag="Anl4g05270" 
/locus-tag="Anl4g05270" 



/locus-tag="Anl4g05270" 



/coclon-start=l 

/product="hypothetical protein" 

/protein-id="CAK42088 . 1" 

/db-xref="GI : 134081833" 

/translation="MALDNARRVYIQLNNIDLHT 

ITTDLSRLHVQRLMRLTAKQASDI 

RELSICSASGGPLSLGYRVLPSCPLKSQASKDLR 

EALKKLKQCRTLRISAVEIKHTDY 

ELAWMGAIDMTHSVLSAFCETQNSLKSLTLDFRT 

DNVASLLRGGLERIPSYGPAELSR 

FPFGSSRINHISLNLNANIAGSPGLCERLLHPLY 

TRYGVNE VS L D LGKAYAEN WL DV 

LRGMGHRMLEPKLKQYCHSLRVLTMENIIFRYNY 

WLRFLHSLKREFRRLQEVNLFWEV 

NTSSESTSLITQWNHNTIYDNTQTRVTYTGPEEH 

GALTALEKIASEDPLSVHL" 

/locus-tag="Anl4g05270" 

/ number=l 

/locus-tag="Anl4g052 70" 
/number=l 

/locus-tag="Anl4g052 70" 
/number=2 

/locus-tag="Anl4g05270" 
/number=2 

/locus-tag="Anl4g05270" 

/ number=3 

/locus-tag="Anl4g05270" 
/number=3 

/locus-tag="Anl4g052 70" 



/number =4 
/locus-tag="Anl4g05280" 
/locus-tag="Anl4g05280" 



/locus-tag="Anl4g05280" 



47092. .47344, 
47501. .47683) 



exon 46541.. 47034 

intron 47035.. 47091 

exon 47092.. 47344 

intron 47345.. 47500 

exon 47501.. 47683 

gene <48251 . . >50618 

mRNA join(<48251. .48422, 

48475. .>50618) 
CDS join(48251. .48422, 

48475. .50618) 



/codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK42089 .1" 

/db-xref="GI:13 40 8183 4" 

/translation="MHQRTRSPLESLNLSLSRLS 

RDKKMHAFHFHLSQEAKRQRLQDK 

ILATHAGIEEYSAGYLKPWQLLKLKIDASEVNIS 

SKDIIKFRPCFFSPLPTDTIRTCL 

SKQPNKKSPDENSLGYAGRAMPCPSTVALMHYPL 

ILDLFREKYATSVRKGFCLDSLTT 

LRECGATSKGKWKQQPGLELAEKLSTERWQILRI 

LEDLEHEVPHAVCDIRQNIQVDRK 

KGLTVAEVKAIMRVMAVRIGLDCYRPHFLMPTLM 

DLNITLSSSSFATTAASQSERPPI 

SRRIHRQDRARELAPLTPHGGKEGHHFQRSRRI" 

/locus-tag="Anl4g05280" 

/number=l 

/locus-tag="Anl4g05280" 
/number=l 

/locus-tag="Anl4g05280" 
/number=2 

/locus-tag="Anl4g05280" 
/number=2 

/locus-tag="Anl4g05280" 
/number=3 

/locus-tag="Anl4g05290" 
/locus-tag="Anl4g05290" 

/locus-tag="Anl4g05290" 

/inference="prof ile:PFAM:PF03169" 
/inf erence="similar to AA 
sequence:UniProtKB: SPBC29B5 .2" 
/note="unnamed protein product; 
Function: isp4 of S. pombe is a 
membrane oligopeptide transporter. 
Remark: isp4 of S. pombe was 
originally identified by 
subtractive screening as gene 
induced during the sexual 
differentiation process. 
Similarity: the predicted A. niger 
protein shows strong similarity to 
protein isp4 of S. pombe, which 
belongs to the OPT subfamily of 
transporters specific for small 
oligopeptides (from the C. 
albicans OPTl gene) . Title: strong 
similarity to protein isp4p 
-Schizosaccharomyces pombe plasma 
membrane" 
/citation= [20] 
/citation= [52] 
/codon-start=l 
/protein-id="CAK42090 . 1" 
/db-xref="GI : 134081835" 
/translation="MAAKEMNEKSFKSPQVSADI 
EIQQGGFDEIQPEFDLSEQEKEQN 
DFDYESEHSPFPEVRAWPETDDPGLPVNTFRMW 
VLGFIFTIVGSGINQFFSLRYPSV 



HIVSLVAELLAYPCGVFLAKVLPLWTISLGRLGS 
FTLNPDRHFNIKEHALIVIMSNVS 
FGYGS ADS TN 1 1 QAS S ARF YNFGL S AGF S VL WL 
CAQLLGFGVAGLAAPWLVEPARI I 
WPQVLSNCAMLETLHSRANTVANGWKISRLRFFL 
YVTAGGFVWYFFPGLMFTALSYFT 
WICWIAPRNVWNQLFGMQTGLGLSPITFDWSQV 
AYNTNPLLSPSWAAINVFAGFALF 
FWIWPGIYYSNTWFTAYLPLMTADVYDRTGTVY 
DTARVISADNTLDVDAYRQYSPPY 
LPATYAFVYGLSFASITAVLTHIGVWHGKEVWAA 
LKGKNKLDIHARLMKSYKKTPWYW 
YAAI lAI ITAI AIVMVEVYHTKLPVYGVFLGL I I 
PAIYMVPCGIIQGITNVDANQLNV 
LAEFIGGYMFEGKPLANMIFKILSTDWGQGVYF 
AMDMKLGHYLKIPPRTLFMAQGLA 
TILGALTQAGVTIWMLGHIQDICSSDQSDGFTCP 
NGRTVYSSSVIWGLVGPRRLYSVG 
RIYSSLLHFFWIGAIAPLITYFLYKYTRKQFWKY 
INWPLIFVGTYNVPPATGINYSSW 
ALVNFAFNHF I KKRFFAWWKKYNYI LAAALDTGL 
ALSGIVIFFCI SYPGAVFPDWWGN 
TVYVNTADGEGVAYKSMPEVGYFGPANGTWS" 
exon 48251.. 48422 /locus-tag="Anl4g05290" 

/number=l 

intron 48423.. 48474 /locus-tag="Anl4g05290" 

/number=l 

exon 48475.. 50618 /locus-tag="Anl4g05290 " 

/number=2 

gene complement (<51277. .>532 /locus-tag="Anl4g05300" 

77) 

mRNA complement (join (<51277. /locus-tag="Anl4g05300" 

.51279,51332. .51599, 
51656. .51726, 
51783. .51947, 
52973. .53002, 
53059. .>53277) ) 

CDS complement (join (51277. . /locus-tag="Anl4g05300 " 

51279,51332. .51599, 
51656. .51726, 
51783. .51947, 
52973. .53002, 
53059. .53277) ) 

/inference= "profile: COGS :COG0 72 4" 
/inference="prof ile:PFAM:PF0 0 0 76" 
/inf erence="similar to AA 
sequence :SWISSPR0T:YRA1 .YEAST" 
/note="unnamed protein product; 
Complex: YRAl of S. cerevisiae 
mediates nuclear export of mRNA by 
interacting with several other 
protein factors, like Mex67. 
Function: YRAl of S. cerevisiae is 
an mRNA-binding protein involved 
in the nuclear export of mRNA 
towards the cytoplasm. Remark: 
although the gene structure looks 
strange, the strong similarity to 
known proteins justify the model. 
Similarity: YRAl S. cerevisiae 
belongs to the evolutionarily 
conserved REF (RNA and export 



gene 
mRNA 
CDS 



complement (51277. .51279 

complement (51280. .51331 

complement (51332 . .51599 

complement (51600 . .51655 

complement (51656 . .51726 

complement (51727. .51782 

complement (51783 . .51947 

complement (51948 . .52972 

complement (52973 . .53002 

complement (53003 . .53058 

complement (53059. .53277 

complement (<54264 . .>550 
12) 

complement (join(<54264. 
.54902,54965. .>55012) ) 
complement ( join (54264 . . 
54902,54965. .55012) ) 



factor binding proteins) family of 

hnRNP-like proteins. Title: strong 

similarity to RNA annealing 

protein Yral - Saccharomyces 

cerevisiae nucleus" 

/citation= [36] 

/citation=[67] 

/citation= [68] 

/citation=[86] 

/codon-start=l 

/protein-id="CAK42091 . 1" 

/db-xref="GI : 134081836" 

/translation="MSAKLDKSLDEILVSRRQGA 

RRRARRSAPNKAANAPVPVGGVKK 

TTKAAKAPGKAVQNGHPVSTESKIMVSGLPSDVN 

EANIKEYFTKSAGPVKRVMLTYNQ 

NGTSRGIASIVFSKPDTAAKAAKDLNGLLVDGRP 

MKIEVWDASHAPEVSAPKPLGER 

VAQTKPQPKPATATKAAAGAKGRKGRARRPNGRG 

NRPKPKTVEEL DAEMVDYF S TTNE 

NAGPAEGNAQANGAAPQQPAATGGEDLGMAE IS" 

/locus-tag="Anl4g05300" 

/number=l 

/locus-tag="Anl4g05300" 
/number=l 

/locus-tag="Anl4g05300" 
/number=2 

/locus-tag="Anl4g05300" 



/number=2 

/locus-tag="Anl4g05300" 



/number=3 

/locus-tag="Anl4g05300" 



/number=3 

/locus-tag="Anl4g05300" 



/number=4 

/locus-tag="Anl4g05300" 



/ number=4 

/locus-tag="Anl4g05300" 



/number=5 

/locus-tag="Anl4g05300" 



/number=5 

/locus-tag="Anl4g05300" 



/number=6 

/locus-tag="Anl4g05310" 



/locus-tag="Anl4g05310" 
/locus-tag="Anl4g05310" 



/inference="profile:COGS:COG5491" 
/inference="profile:PFAM:PF03357" 



/note="unnamed protein product; 
Function: the S. cerevisiae 
homolog Did3p is a class E Vps 
factor, which function in the 
maturation of a late 
endosome/prevacuolar compartment 
into multivesicular bodies that 
then fuse with the vacuole. 
Remark: DID3 of S. cerevisiae is 
also called YKL041w. Remark: S. 
cerevisiae Did3p is probably 
involved in the ubiquitin-mediated 
maturation of multivesicular 
bodies. Title: strong similarity 
to protein involved in vacuolar 
protein sorting Did3 - 
Saccharomyces cerevisiae endosome" 
/citation= [53] 
/citation=[76] 
/ codon-start=l 
/protein-id="CAK42092 .1" 
/db-xref="GI : 134081837" 
/translation="METLKAVFFGPDPQTQMRKC 
NQLIRVNTRQLDRDIAQLKTLESK 
TRQYIMNSSKRAQRNPSQAKQANMEAKTFARELV 
R I RI-;C' C T R L H T S R AQ L Q S VQMQ VN 
EAFi:.VRKIQGSLKKSTGIMKDVNTLVQMPELNAT 
MRQLSTELVRAGIIEEMVDDAIPN 
NELLEEEEEEAEEEVDKVLQEILHGKLSQVEGVQ 
PEKPLEEAPEPEDEFADQEATLEQ 
MRGRLEALKS" 

complement (54264. .54902 /locus-tag="Anl4g05310" 
) 



complement (54903 . .54964 
) 

complement (54965. .55012 
) 

<55690. .>56601 
join(<55690. .55720, 
55913. .56238, 
56300. .56464, 
56527. .>56601) 
join(55690. .55720, 
55913. .56238, 
56300. .56464, 
56527. .56601) 



/ number=l 

/locus-tag="Anl4g05310" 
/number=l 

/locus-tag="Anl4g05310" 
/number=2 

/locus-tag="Anl4g05320" 
/locus-tag="Anl4g05320" 



/locus-tag="Anl4g05320" 



/inf erence="similar to AA 
sequence :PIR:T39220" 
/note="unnamed protein product; 
Function: W0S2 of S. pombe is a 
cochaperone protein that interacts 
with cdc2 in the control of the 
M-Gl transition. Similarity: W0S2 
of S. pombe belongs to the p23 / 
wos2 family. Title: strong 
similarity to cell cycle regulator 
p21 protein wos2p - 
Schizosaccharomyces pombe" 
/citation= [65] 



/codon-start=l 
/protein-id="CAK42093 .1" 
/db-xref="GI : 134081838" 
/translation="MPELPQVHPEVTWAQRSSDS 
DAERNYL YVS I KAADVARP AAT I D 
IKPTSVTFTGDSKKGVKYHVSLDLFAEIDPENSK 
VNHTDREVELVLRKKELKAEYWPR 
LLKTTQKIHFLKTDFDKWVDEDEQDEVNEDDYAN 
NFGGFEGLGGEGGLSNIDFSKLGA 
GLGGDAGAAGEEEEEEEMPELEEGEAEGAKSSKI 
QEVS" 



exon 


55690. 


.55720 


/locus-tag="Anl4c 


J05320' 








/number=l 




intron 


55721. 


.55912 


/locus-tag="Anl4(: 


J05320' 








/number=l 




exon 


55913. 


.56238 


/locus-tag="Anl4g0532 0 








/number=2 




intron 


56239. 


.56299 


/locus-tag="Anl4c 


J05320' 








/number=2 




exon 


56300. 


.56464 


/locus-tag="Anl4c 


J05320' 








/number=3 




intron 


56465. 


.56526 


/locus-tag="Anl4c 


J05320' 








/number=3 




exon 


56527. 


.56601 


/locus-tag="Anl4c 


J05320' 








/number=4 




gene 


<57642 


. .>58927 


/locus-tag="Anl4c 


J05330' 


mRNA 


join (< 


57642 . .57701, 


/locus-tag="Anl4c 


J05330' 




57764. 


.57781, 








57888. 


.58214, 








58242. 


.58438, 








58498. 


.>58927) 






CDS 


join(57642. .57701, 


/locus-tag="Anl4c 


J05330' 




57764. 


.57781, 








57888. 


.58214, 








58242. 


.58438, 








58498. 


.58927) 







/inference="prof ile:COGS:COG52 52" 
/inference="prof ile:PFAM:PF006 42" 
/inf erence="similar to AA 
sequence : PIR: S6 1651" 
/note="unnamed protein product; 
Title: strong similarity to 
hypothetical protein YOR091w - 
Saccharomyces cerevisiae" 
/codon-start=l 
/protein-id="CAK42094. 1" 
/db-xref="GI : 134081839" 
/translation=" MPPKKQQNEPKKKKAT VE DK 
TFGMKNKKGGSAKKQIAQLQAQAA 
SNKNADAKKKEAEKARREAEKKAAEQAKKEALEL 
FKPVQVQKVPFGVDPKTVLCVFYK 
QGNCEKGRKCKFSHDASVERKAAKKDLYTDSRDK 
KDTIDDWDEEKLRKVILSKHGNPR 
TTTDKVCKYF I EAVENQKYGWFWTCPNGGDKCMY 
KHSLPPGFVLKTKEQKAAEKALMD 
KSPLNTLTLEDWLESERHKLTGNLTPVTPETFAK 
WKKERL DKKAAE E Q ARKAKE T T GR 
TLFEEGNWRADDESSDEEEDDGAWNLEALRRETE 
RIRERKEEERLAALHGTPVPVSND 
ETIAQEGAG" 

exon 57642.. 57701 /locus-tag="Anl4g05330" 

/number=l 



intron 

exon 

intron 

exon 

intron 

exon 

intron 



sig-peptide 
mat -peptide 



57702. .57763 

57764. .57781 

57782. .57887 

57888. .58214 

58215. .58241 

58242. .58438 

58439. .58497 

58498. .58927 

<59882. .>61522 
join(<59882. .60273, 
60344. .60429, 
60510. .61189, 
61367. .61434, 
61480. .>61522) 
join(59882. .60273, 
60344. .60429, 
60510. .61189, 
61367. .61434, 
61480. .61522) 



59882. .59956 



join(59957. .60273, 
60344. .60429, 
60510. .61189, 
61367. .61434, 
61480. .61519) 

59882. .60273 



/locus-tag= 

/number=l 

/locus-tag= 

/ number=2 

/locus-tag= 

/ number=2 

/locus-tag= 

/number=3 

/locus-tag= 

/number=3 

/locus-tag= 

/number=4 

/locus-tag= 

/number=4 

/locus-tag= 

/number=5 

/locus-tag= 

/locus-tag= 



"Anl4g05330" 
"Anl4g05330" 
"Anl4g05330" 
"Anl4g05330" 

"Anl4g05330" 

Anl4g05330" 

"Anl4g05330" 

"Anl4g05330" 

"Anl4g05340" 
"Anl4g05340" 



/locus-tag="Anl4 



/inference="prof ile:COGS:COG4225" 

/inf erence="similar to AA 

sequence :PIR:E83 710" 

/note="unnamed protein product; 

Title: strong similarity to 

hypothetical protein BH0485 - 

Bacillus halodurans" 

/codon-start=l 

/protein-id="CAK42095. 1" 

/db-xref="GI : 134081840" 

/translation="MTQSFVSFTAVLLTGLLSHS 

QPAAAVGISCDPNTRPYSAWMTDS 

VIARRDAILGDFAAGDDIYPNISRIGLFESTVIR 

LKDYYDADSCGQVDWHSYTLQGAE 

SLLPAVLNVSFDVNAPLTVFAAGDPMYHQYDRAP 

NETYKQALDTLHDAFNARTRDSEG 

AFWYFNPYPNWGVLEGLYPLGSFISMWKTYFEPT 

NTTLSDSLMLSLDLFKDHCHDDSS 

GLLFHGYDSSKTASWADPTTGASSYVWNRALGNW 

FMTLVDLLERSGKNSAILNETQRD 

DVYTKYWNVANTIINDADEETGCWWQVMLHGGEE 

GNYIESSGSAQFVYGLLKGARLGY 

LQGKTPNGVGYTDAADKCYNYLVSEFVKEEADGS 

LGYNGTVGDCGLYDPTFELAEVDL 

LTTNTTNTITTRSCDICITEADDKIERLHR" 

/locus-tag="Anl4g05340" 

/ inf erence="protein 

motif :SignalP: 2.0" 

/locus-tag="Anl4g05340" 



/product="unnamed" 
/locus-tag="Anl4g05340" 









/number=l 




intron 


60274. 


.60343 


/locus-tag= 
/number=l 


"Anl4g05340' 


exon 


60344. 


.60429 


/locus-tag= 
/number=2 


"Anl4g05340' 


intron 


60430. 


.60509 


/locus-tag= 
/number=2 


"Anl4g05340 


exon 


60510. 


.61189 


/locus-tag= 
/number=3 


"Anl4g05340' 


intron 


61190. 


.61366 


/locus-tag= 
/number=3 


"Anl4g05340' 


exon 


61367. 


.61434 


/locus-tag= 
/number=4 


"Anl4g05340' 


intron 


61435. 


.61479 


/locus-tag= 
/ number=4 


"Anl4g05340' 


exon 


61480. 


.61522 


/ locus-tag= 
/number=5 


"Anl4g05340 


gene 


complement (<6 2 2 53. .>637 


/locus-tag= 


"Anl4g05350' 




96) 








mRNA 


complement (join (<62253 . 


/locus-tag= 


"Anl4g05350' 




.62358 


,62418. .62491, 








62545. 


.62677, 








62734. 


.63473, 








63541. 


.63615, 








63704. 


.>63796) ) 






CDS 


complement ( join (62253 . . 


/locus-tag= 


"Anl4g05350' 




62358, 


62418 . .62491, 








62545. 


.62677, 








62734. 


.63473, 








63541. 


.63615, 








63704. 


.63796) ) 







/inference="profile:COGS:COG1073" 
/inf erence="similar to AA 
sequence:UniProtKB:AF115902 .1" 
/note="unnamed protein product; 
Function: aygl of A. fumigatus has 
an unknown function, but it is 
contained in the gene cluster 
responsible for conidial 
pigmentation. Phenotype: aygl null 
mutants of A. fumigatus have 
yellowish-green conidia. Title: 
strong similarity to hypothetical 
yellowish-green 1 aygl - 
Aspergillus fumigatus" 
/citation=[62] 
/ codon-start=l 
/protein-id="CAK42096 .1" 
/db-xref="GI : 134081841" 
/translation="MAPWILGEKFNTVYPHKGSI 
KALWETKWKFACEKSVYPFHDGAI 
EDFRPIFQKLIDENINDAYTDAYTQAFFPVAEAL 
ENKASAALNNNNVEMASDLLRRAA 
WYRISRFPYVDPTREDIKKEAFNRQKKVYLKAA 
SFWKPTIQEVI IPHKHKSATDGAY 
VPLFVRVPEHATAENPVPVWLMTGLDGYRPDNS 
QRTHEIINRGWATVICEIPGTADS 
PADP S DPE S ADRQWTTVLD YMATRPEF DMS RVAA 
WGLSAGGFYAIRAAHTHRDRFVGT 
LAHGPGCHYFLDPEWLSRVDDHEYPFLLTPAWAK 
KYGYSNPEDFKKHGQKKFSLLETG 
ILDQPSCRLLLLNGVDDGVTPIEDCLMLFNHGSP 



complement (62253 . .62358 

complement (62359. .62417 

complement (62418 . .62491 

complement (62492 . .62544 

complement (62545. .62677 

complement (62678. .62733 

complement (62734. .63473 

complement (63474. .63540 

complement (63541. .63615 

complement (63616 . .63703 

complement (63704. .63796 

<63948. .>65270 
join(<63948. .64102, 
64223. .64301, 
64493. .64828, 
64917. .>65270) 
join(63948. .64102, 
64223. .64301, 
64493. .64828, 
64917. .65270) 



KEGRFFHGLPHMGYPHSLVPSYKW 

FEDVLRSPQEPLKN" 

/locus-tag="Anl4g05350" 

/number=l 

/locus-tag="Anl4g05350" 
/number=l 

/locus-tag="Anl4g05350" 
/number=2 

/locus-tag="Anl4g05350" 
/number=2 

/locus-tag="Anl4g05350" 
/number=3 

/locus-tag="Anl4g05350" 
/number=3 

/locus-tag="Anl4g05350" 
/number=4 

/locus-tag="Anl4g05350" 
/number=4 

/locus-tag="Anl4g05350" 
/number=5 

/locus-tag="Anl4g05350" 
/number=5 

/locus-tag="Anl4g05350" 
/number=6 

/locus-tag="Anl4g05360" 
/locus-tag="Anl4g05360" 



/locus-tag="Anl4g05360" 



exon 
intron 



63948. .64102 
64103. .64222 



/ codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK42097. 1" 

/db-xref="GI : 134081842" 

/translation="MNSPPNLVTSQPPLRVLRAT 

NAGGLTNPRPLREQGASLSVSGLG 

QSVRLDTLHPCSSPTKSFYLEGKRPLIREPKLLG 

GMFTTWLSSSECLASPRFQGGRR 

IPMPAVAVWMRKDCSLLQGLGRGAWWSVSTKQI 

CFDKRGQVFRGRRLPEAYWHKKRN 

TKHGDLLYSVEKASRASVQTTTLKRVTVATLSRT 

RQVCQYLRNFLAGASCISELGRYL 

HVGEAGRLPT SP SP S LEARTGYMLAFYVYAGMI S 

PGRRLKARLAS DRL I CRLALRPCE 

KCLFGSDAEAATPTRHPSRQGFLSLLLRGLN" 

/locus-tag="Anl4g05360" 

/number=l 

/locus-tag="Anl4g05360" 









/number=l 






exon 


64223. 


.64301 


/locus-tag= 
/number=2 


"An 14c 


J05360' 


intron 


64302. 


.64492 


/locus-tag= 


"Anl4g05360' 








/number=2 






exon 


64493. 


.64828 


/locus-tag= 


"Anl4g05360 








/number=3 






intron 


64829. 


.64916 


/locus-tag= 
/number=3 


"An 14c 


J05360' 


exon 


64917. 


.65270 


/locus-tag= 
/number=4 


"An 14c 


J05360' 


gene 


<65395 


. .>67116 


/locus-tag= 


"An 14c 


J05370' 


mRNA 


join(<65395. .65730, 


/locus-tag= 


"An 14c 


J05370' 




65792. 


.66891, 










66909. 


.>67116) 








CDS 


join(65395. .65730, 


/locus-tag= 


"An 14c 


J05370' 




65792. 


.66891, 










66909. 


.67116) 









/EC-number= "1.16.3.1" 
/inference="prof lie: COGS: C0G2 132" 
/inference="prof ile:PFAM:PF00394" 
/inf erence="similar to AA 
sequence :PIR:A5542 8" 
/note="unnameci protein prociuct; 
Similarity: the predicted ORF 
shows very strong similarity to 
abrl of A. fumigatus, a gene of 
the conidial pigmentation gene 
cluster; abrl mutants have brown 
conidia,and the proposed function 
of abrl is multicopper oxidase. 
Title: strong similarity to cell 
surface ferroxidase precursor Fet3 
- Saccharomyces cerevisiae plasma 
membrane" 
/citation= [28] 
/citation= [37] 
/citation=[40] 
/citation=[50] 
/citation=[62] 
/ codon-start=l 
/protein-id="CAK42098 . 1" 
/db-xref="GI : 134081843" 
/db-xref="GOA:A2R3T0" 
/translation="MIRTLITLGLYGFVQQVSSK 
DVYLDWNITWVTAAPDGYARPVIG 
INGHWPCPQIDVDVGDNLIVDVYNGLGNESTGIH 
WHGLHQNYNGYMDGVPGVTQCELH 
PNQRMRYIVPINQTGTYWYHSHEPGQYPDGLRGP 
I IVHDNVPAPYHYDEEMTLTLSDW 
YHVQIPVLLDEFIVPSDHAPYGGQEPLPDAVLFN 
DMHNTKISVKPGTTYLIHWCMGN 
FPGHALVIDDHDMTIVGMDGIAVEPHFLPPQYLR 
VAVGQRVDILLTTKNDTSKNYAIW 
DGLDIDEMFVQEGRSPPQGYNPNGTAWLMYNEDA 
PLPPPPVIHVNNASLDFLDDVTLT 
PRDHLPLIDPIQHQFIFDVYPTTVDGKPAYSINN 
MTYNAPDSPTLYTAIYSDPEACMD 
PDLYGDVNPFWNYGDWEIVLNNHHDNLHPWHL 
HGHDFQVLQRTYPYGGFFDGYRNV 
SATPMRRDTLMVEPLGHFVIRFRAMNPDKLDVSK 
LTLIGQLRIWMFHCHIEWHVEIRL 



sig-peptide 



mat -peptide 



65395. .65451 



join(65452. .65730, 
65792. .66891, 
66909. .67113) 



MAIIIEAPDQLQNMTIPDDHLKICGNTQPSNWAP 
LAPSP" 

/locus-tag="Anl4g05370" 
/inference= "protein 
motif : Signal?: 2.0" 
/locus-tag="Anl4g05370" 



/product=" unnamed" 
/locus-tag="Anl4g05370" 
/number=l 

/locus-tag="Anl4g05370" 
/number=l 

/locus-tag="Anl4g05370" 

/number=2 

/locus-tag="Anl4g053 70" 
/number=2 

/locus-tag="Anl4g05370" 
/number=3 

complement (<67552 . .>687 /locus-tag="Anl4g05380" 
35) 



65395. .65730 



65731. .65791 



65792. .66891 



66892. .66908 



66909. .67116 



complement (join (<67552 . 
.67985,68039. .>68735) ) 
complement (join (67552 . . 
67985,68039. .68735) ) 



/locus-tag="Anl4g05380" 
/locus-tag="Anl4g05380" 



/EC-number=" 1.4.3.3" 
/inference="prof lie: COGS :COG066b" 
/inference="prof ile:PFAM:PF01266" 
/note="unnamed protein product; 
Catalytic activity: the D-amino 
acid oxidase catalyzes the 
reaction: D-Amino acid + H20 + 02 
= 2-Oxo acid + NH3 + H202. 
Catalytic activity: the D-amino 
acid oxidase has a wide 
specificity for D-amino acids, 
acting also on glycine. Cofactor: 
D-amino acid oxidase is a 
flavoprotein (FAD is the 
cofactor) . Title: strong 
similarity to D-amino acid oxidase 
DAOl - Trigonopsis variabilis" 
/citation=[47] 
/ codon-start=l 
/protein-id="CAK42099 .1" 
/db-xref="GI : 134081844" 
/db-xref="GOA:A2R3Tl" 
/translation="MAEKETIWIGYWSQCHPL 
NHNNSTTNHPHPFQPNRAGI IGLT 
TALYIQQRLSPSQRVLIAARDFPHSTSLNYASPW 
AGAHYRPVPGSNAQHTREETQARR 
TYAHFKTLAAQEPGAGVQSTTGIEYLENPPAEYL 
NEKNIQAAYGHLDGFEYLQPGQMP 
GDVKWAVKYKTFWNSPVYCAWLLREFVLRGGEV 
KEYTFVDLREGFYLAERVRAWNC 
SGLGFGDEKSYIIRGQTCLVRNPCSATITRQNSD 
GSWSFCIPRPLGGGTIIGGTKQPH 
NWDPNPSMETRAQLLANAAKWFPFEEGSKEEFDV 
IRDIVGRRPAREGGMRIEVEKINQ 
GQEVIVHAYGAGGRGFELSWGVAEDVYDLMRQNG 
LIKAKASL" 

complement (67552 . .67985 /locus-tag="Anl4g05380" 



gene 
raRNA 



gene 
mRNA 
CDS 



complement (67986. .68038 



complement (68039. .68735 
) 

complement (<6 9 56 0. .>702 
96) 

complement (join (<69560 . 
.69700,69757. .69922, 
69974. .70200, 
70261. .>70296) ) 
complement (join (69560. . 
69700,69757. .69922, 
69974. .70200, 
70261. .70296) ) 



/number=l 

/locus-tag="Anl4g05380" 



/number=l 

/locus-tag="Anl4g05380" 



/number=2 

/locus-tag="Anl4g0539 0" 



/locus-tag="Anl4g0539 0" 



/locus-tag="Anl4 



/inference="profile: COGS :C0G1 412" 
/inference="prof ile:PFAM:PF04900" 
/inf erence="similar to AA 
sequence:UniProtKB:AF132969 .1" 
/note="unnamed protein product; 
Title: strong similarity to 
hypothetical protein CGI-35 - Homo 
sapiens" 
/codon-start=l 
/protein-id="CAK42100 .1" 
/db-xref="GI : 134081845" 
/translation=" MGVQKKTRKF AQMKRAI KAR 
DERLKKPEAKKETPKEDQLTRQVT 
QAPSNMFFAANTALGPPYHVLVDTNFVSHAIRAK 
QDLLTSMMDLLYAKCIPTFSDCTI 
AELEKLGDKYRLALRVAKDPRWSRVKCSHKGTYA 
DDCIVDRVTKHRIYIVATNDQDLC 
RRLRKIPGVPIMKVARGKFTIEKLPDALD" 



complement (69560. .69700 

complement (69701. .69756 

complement (69757. .69922 

complement (69923 . .69973 

complement (69974. .70200 

complement (70201. .70260 

complement (70261. .70296 

<72683. .>73072 
<72683. .>73072 
72683. .73072 



/locus-tag="Anl4g05390" 
/number=l 

/locus-tag="Anl4g05390" 

/number=l 

/locus-tag="Anl4g05390" 
/ number=2 

/locus-tag="Anl4g05390" 
/number=2 

/locus-tag="Anl4g05390" 
/number=3 

/locus-tag="Anl4g05390" 
/number=3 

/locus-tag="Anl4g0539 0" 
/number =4 

/locus-tag="Anl4g05400" 
/locus-tag="Anl4g05400" 
/locus-tag="Anl4g05400" 



/note="unnamed protein product; 
Title: weak similarity to 



sig-peptide 

mat -peptide 
exon 



72683. .72742 



72743. .73069 

72683. .73072 

<73435. .>78401 
join(<73435. .73846, 
73906. .74029, 
74087. .74398, 
74453. .75231, 
75292. .>78401) 
join(73435. .73846, 
73906. .74029, 
74087. .74398, 
74453. .75231, 
75292. .78401) 



proteinase inhibitor PID -Solanum 

tuberosum" 

/citation=[17] 

/codon-start=l 

/protein-id="CAK42 1 01.1" 

/db-xref="GI : 134081846" 

/translation="MLCVNPVFCVSAIVPHFTFC 

SPPRLCPGLPYYLSPLHNSCQQRG 

SLPSALYPWSFHCIILCLSFAFLCVCLFVRFGFS 

ALSPSSVAVPCYNLASSSPPPNPK 

PVETKGRHSDHSSYSYISTLTWFSIL" 

/locus-tag="Anl4g05400" 

/inference= "protein 

motif :SignalP: 2.0" 

/locus-tag="Anl4g05400" 

/product="unnamed" 

/locus-tag="Anl4g05400" 

/number=l 

/locus-tag="Anl4g05410" 
/locus-tag="Anl4g05410" 



/locus-tag="Anl4g05410" 



/inf erence="similar to AA 

sequence : UniProtKB : NCBIKI 1.22" 

/note="unnamed protein product; 

Similarity: A. niger EST 

EMBLEST:BE760323 corresponds to 

the 3' portion of the last exon 

and a fragment of the putative 

3'-UTR of the gene. Title: strong 

similarity to hypothetical protein 

CAD21189.1 - Neurospora crassa" 

/codon-start=l 

/protein-id="CAK42102 .1" 

/db-xref="GI : 134081847" 

/translation="MVELSWNRNPVQVPDPRPSR 

MKPAAPPVSTQPQTPPARLEPCAS 

TASLLLYAQGSVIVCLHHDTLALERRFENHQDDI 

GFIYVDNVSERGAGRLWSYDVSQ 

TAIVWDLFTGSVLARFASFEQLKVAAWMRNGNVA 

FGNEKGDVIIFEPSTSEHISCRTI 

FDPITALAPATDCRTYAIGYQNGSIMLATLHPTF 

SILHTMSTSRGPSAIISLAWHASS 

SKQKSDMLATLAANGDLRVWSVAKPPQKEAPRVI 

RVLKRSDGSPATEPKWMAWSKNGR 

I VQFLEGETWAWDVRTKHVT YEP I PT I DNPRGFA 

NYGPTATLFTLGPQYTVQQYDIDS 

PAMVANVQHAPSGALPAPMEQPRLRTLQEPPTMR 

ESGAMFGSRRTPFDTNGIEAVRQQ 

RADVASPVNRSHANSVTNSVSSKASSGMHKIPFS 

PPSRSGQTTTSFSLTSASGRETPQ 

PSGASYAYASSVSMSSTKSSRAGSRLRNEVQLSP 

AEKPVDLFPFTRARLNDVPYGHHQ 

PLDESRLTPDDLRQQMLSWFGFDGDIEELIRDE 

LSNHPEASQSAVLLAQWIGESDTD 



exon 

intron 

exon 

intron 

exon 

intron 

intron 

exon 

gene 

mRNA 

CDS 



73435. .73846 
73847. .73905 
73906. .74029 
74030. .74086 
74087. .74398 
74399 . . 74452 
74453. .75231 
75232. .75291 



75292. 



401 



complement (<78998 . 

20) 

complement (<78998 . 
20) 

complement (78998. . 
) 



NMLSMISSGPASITDWMLLAFSQMSGESQSNKIG 

QALVQKLLEIGDVHTAATILLGLG 

DRNDAIEVYVSQNYYMEAILMTCLVMPTDWQRQS 

YLVRRWGEHWSHSQQQLAIRCFM 

CTGVEPSEPWTSPAAQQATSFAEAITRRSPMTSP 

EPYPNPASLMVPNSHPSPAVNGRP 

VSKTPALKLITSFDSQPNNRFKFPGLKSDDRTPT 

NAPGVTPIAESAWESALSPGGFG 

SYKLNNIQSINNAMNTRTTTPAFNRRRLPSIGET 

PIDVHPPTFPSTGYNKQALDYGST 

SENDESQPLEQKGGEADDEQLGLLSSARYDPEEY 

KPSPQTAVQATADTFSTIKGLPSP 

AAGVFEALKERSDSRNGTRDRKPDGLQLNLHHRN 

ASENDMHAQSEMLANFRSPASTLN 

SF S S ARSP S VS GRS I DP Y I S S LGEANYHAKKYQG 

NHHNGRGRRNTDDSASQTSARRYR 

SRNASQETRGRNEKRYIPPAKRSPSSPVPMSPEE 

LARINVNAANKPSSRTRSASRVRK 

PRSRNSSERRQNRSASRNTTNRLGTENQDPSSRG 

RSMDRQESGTRSPSSPLPGSSAED 

ALRLVASDRERLRTRQRSGSRRPQKGAGGRGETP 

SESKWARARSRSRHDTEPTESAQQ 

GVLEFVNGS TAHDD I Y I VDNSP S ANNMLAQAS HA 

EHTAVLQDKLVEQRRKELAAAELE 

ARRLSLARNPSAPNIPFPGELQPSRSALENATFS 

INAFAPRTPGRNRTATNKGSPEYP 

SSSDSNSSRSGVPLGLPATPKAMKHPKYNGYEET 

APTPPVP S DAG ILLS DAKYQGDAE 

RIGRSMSVPVPESHQFGAVPTDLPMHPRYNPHLP 

RSRSTSRTRSMAPRRDSGGHSYGG 

SPVTVS lEET I ENAMANQQTENPP I 

LPELQHLSPPPPPPPPTAPLASPRESSGTIDIA 

IDNEDFGRLLPRAMTTGPTLHMDGRPGLEQRRMS 

FEHRRGRSVNESLSSKIRNLARMG 

STKSTDSWVATDISIPYESVIVDGRI" 

/locus-tag="Anl4g05410" 

/number=l 

/locus-tag="Anl4g05410" 
/number=l 

/locus-tag="Anl4g05410" 
/number=2 

/locus-tag="Anl4g05410" 

/ number=2 

/locus-tag="Anl4g05410" 
/ number=3 

/locus-tag="Anl4g05410" 
/number=3 

/locus-tag="Anl4g05410" 
/number=4 

/locus-tag="Anl4g05410" 
/number=4 

/locus-tag="Anl4g05410" 
/number=5 
>803 /locus-tag="Anl4g05420" 

>803 /locus-tag="Anl4g05420" 

i0320 /locus-tag="Anl4g05420" 

/note="unnamed protein product; 
Title: weak similarity to 
hypothetical Ena-VASP-like protein 



gene 
mRNA 



complement (78998 
) 

complement (<806: 
92) 

complement ( join 
.80699, 80769. .8 
81145. .81440, 
81545. .>81592) ) 
complement (join 
80699,80769. .811 
81145. .81440, 
81545. .81592) ) 



- Homo sapiens" 
/codon-start=l 
/protein-id="CAK42103 .1" 
/db-xref="GI : 134081848" 
/translation="MQLTKPAQHNHSASSIPSSS 
KLPPSITTTTTPTKTPHHRSPPST 
MSFTSDPPSTGDTSPVLKDPNDPFSRFYPLKVKP 
ELFKRPDPITMYLGFGGQSVWNAV 
ATMNVIRTLDQHQRIANRPLTQAEADATIEHSTA 
RLYHSRIGQPVGVLAGAIVTYKAA 
QNTEAWKKNFSSEAGLRRVDLLLEGIQKWARANP 
DTWRKEAIRAGLKTALFAFVGLML 
GDGIAGANDINHMMSDSRLKGLWAEARKRRDELR 
GHVTTGKMPPPVPTTPAPRPTGQE 
VPKSEEQNLPQQLGDGYGMQSGSDEEAARFGSST 
TSWGSETKKVATPADEAPSPSKDG 
NGDFWDDASPVAAEYRGKDQSMSGESAWERIRRQ 
NNVS AQRE TRSTVSRPAPSSTASA 
PAPAYDKPSERELAQAEFDRMLEAERKMSTDSLN 
TPQQNRSGWWAKWD" 
.80320 /locus-tag="Anl4g05420" 

/number=l 
10. .>815 /locus-tag="Anl4g05430" 

(<80610. /locus-tag="Anl4g05430" 
1072, 



(80610.. /locus-tag="Anl4 
072, 



/EC-number="l .1.1.-" 



/inference="prof ile:COGS:COG102 8" 
/inference="prof ile:PFAM:PF0 0106" 
/note="unnamed protein product; 
Similarity: SDR is a very large 
family of enzymes, most of which 
are known to be NAD- or 
NADP-dependent oxidoreductases 
with different specificities. 
Similarity: the predicted ORF 
shows strong similarity to several 
hypothetical and described members 
of the short chain dehydrogenase 
(SDR) protein family. Title: 
similarity to meso-2 , 3-butanediol 
dehydrogenase (D-acetoin forming) 
budC - Klebsiella pneumoniae" 
/citation=[59] 
/citation=[80] 
/citation=[81] 
/citation= [85] 
/ codon-start=l 
/protein-id="CAK42104 . 1" 
/db-xref="GI : 134081849" 
/db-xref ="GOA: A2R3T6 " 
/translation="MASKPFAIIAGVGPGTGASI 
ARREARS YP I WLARNPANYQPW 
DEINSSGGQATGISTDLSDSTGVRSAFEQIKSQF 
QGSSLAAAVFNSGGGFVRKPFLEL 



gene 
mRNA 
CDS 



complement ( j oin ( 8 1 4 
81440, 81545. .81592) 



complement ( join (80 
80699, 80769. .81072 
81145. .81434) ) 

complement (80610. 
complement (80700. 
complement (80769 . 
complement (81073. 
complement ( 81145 . 
complement ( 81441 . 
complement (81545 . 



TETEFSSGFEGQGKGAFNFAQNVLPLFPAESQGG 

LKYPPTLIFTGATASVKGSAQFAS 

FAAGKFALRALAQSLAREFGPKGIHVSHVIIDGV 

IDIPRTKGWVFEKEDAKLDPEAIA 

DSYWHLHTQPRTTFGFELDLRPYVERW" 

/locus-tag="Anl4g05 43 0" 

/ inf erence="protein 
motif :SignalP: 2.0" 
/locus-tag="Anl4g05430" 



<82379. .>83449 
<82379. .>83449 
82379. .83449 



gene 
mRNA 
CDS 



82379. .83449 

<83874. .>84647 
<83874. .>84647 
83874. .84647 



/product="unnamed" 
.80699 /locus-tag="Anl4g05430" 

/number=l 
.80768 /locus-tag="Anl4g05430" 

/number=l 
.81072 /locus-tag="Anl4g05430" 

/number=2 
.81144 /locus-tag="Anl4g05430" 

/number=2 
.81440 /locus-tag="Anl4g05430" 

/number=3 
.81544 /locus-tag="Anl4g05430" 

/number=3 
.81592 /locus-tag="Anl4g05430" 

/number=4 

/locus-tag="Anl4g05440" 

/locus-tag="Anl4g05440" 

/locus-tag="Anl4g05440" 

/note="unnamed protein product; 

Title: strong similarity to 

hypothetical protein encoded by 

Anllg07600 - Aspergillus niger" 

/codon-start=l 

/protein-id="CAK42105. 1" 

/db-xref="GI : 134081850" 

/translation="MDTDKAAETTRLHEENEDPA 

VKDVIRVASYHRLDYDLAVIRTFP 

RPHHLIRDLLTPKTLIIDLNSRLGVLDVFPLEFI 

HEICLMLDVQSLFRFHRVNRRARQ 

I AS ASGYYKAWKHAI GALFVI LRTGMAS WYKL S 

DLYDVLCTKNCAICNDFGGFIFLP 

SFMRCCFFCIEDDELPPVLPASQVRPYLNRRSAP 

PLSSRIPMIKTLPGIYSLDESKRT 

RRFELVLEQSAISLFSSDRVPTFPKKYSRDTELQ 

RYMVS T AL PYLDNKNDVIEKGVSC 

SGCQISLEKALQSSGAEYDVWDDRDKIYSHDQFM 

EHFETCPEARKLWELSEGGTIEPD 

IPAFVKNGGCAKKRELKNVFYH" 

/locus-tag="Anl4g05440" 

/number=l 

/locus-tag="Anl4g05450" 
/locus-tag="Anl4g05450" 
/locus-tag="Anl4g05450" 



83874. .84647 

<85682. .>87100 
join(<85682. .85731, 
85821. .86058, 
86116. .86309, 

86392. .>87100) 
join (85682 . . 85731, 
85821. .86058, 
86116. .86309, 
86392. .87100) 



exon 

intron 

intron 
exon 



85682. .85731 
85732. .85820 
85821 . . 86058 
86059. .86115 
86116. .86309 



/inference="profile:COGS:COG0515" 

/note="unnamed protein product; 

Title: strong similarity to 

hypothetical protein encoded by 

An08gll330 - Aspergillus niger" 

/codon-start=l 

/protein-id="CAK42106 .1" 

/db-xref="GI : 134081851" 

/translation="MDDSDYRILIQNRVRYLTIR 

PGTFDRATLSMPLSSLPDLPKDCT 

WNSAHLSRDKSSGELWELQKRDLVGIADVWHPS 

QVDCLKLRRTRQLTATTFEATYCG 

SEPISSPPLNEDTTVIAKVARFEWEIPRLSQETS 

lYKQLQDTGLAPRFLGHVHEHGRV 

IGFILERLKGREAGIKDLPLCQSVLKRLHDIGIL 

HGDVNRYNF I VRGDAVRL I DFERS 

QQSKGATTLMHAEMESLCEQLLEDTGRGAGFTRS 

NQQIE" 

/locus-tag="Anl4g05450" 
/number=l 

/locus-tag="Anl4g05460" 
/locus-tag="Anl4g05460" 



/locus-tag="Anl4g05460" 



/inference="prof ile:PFAM:PF04438" 

/inf erence="similar to AA 

sequence :PIR:S46 736" 

/note="unnamed protein product; 

Title: similarity to hypothetical 

protein YHR040w -Saccharomyces 

cerevisiae" 

/codon-start=l 

/protein-id="CAK42107.1" 

/db-xref="GI : 134081852" 

/translation="MSTSTPDSPLSDLCSICHLQ 

PPKYRCPRCSTRTCSLPCTRRHKL 

WSQCSGVRDPAAYLRRSELATESAFDRDFNFITG 

I ERS I ERAGRDAENRG I DGVAAMG 

KKGGAGAGGKRKRGDGGAGGGTRAEMEFLRRAEE 

RGVKWRAPKGMSRNKGNGSRWLG 

RNQCLVWT VE W I L S DGEKRMRNC S E T AP VAD S YD 

RWPVPKEEGSDEQGQKQTGDEKP 

EGTSELKTEEQQPQDAPSTTAPEPASTTEPTQPT 

DAPTAQPEASNATPITSDSLPLTP 

HRDLYFYLHRPRTATKQPVLAPLPPKISLAAALR 

DHIVLEFPTIYVLPDSPDTIRAQE 

DAKYILEEEYLRTHQSPEESSDDTGDADDAQLPP 

GAIDLRGVDESKVLEVLQKDLFEP SAAA" 

/locus-tag="Anl4g05460" 

/ number =1 

/locus-tag="Anl4g05 46 0" 
/number=l 

/locus-tag="Anl4g05460" 
/number=2 

/locus-tag="Anl4g05460" 
/number=2 

/locus-tag="Anl4g05460" 



gene 
mRNA 
CDS 



sig-peptide 



86310. .86391 



86392. .87100 



complement {<81116 
39) 

complement (join(<87776 
.88536, 88589. .>89339) ) 
complement (join(87776. 
88536, 88589 . .89339) ) 



/number=3 

/locus-tag="Anl4g05460" 
/number=3 

/locus-tag="Anl4g05460" 
/number=4 

893 /locus-tag="Anl4g05470" 



/locus-tag="Anl4g05470" 
/locus-tag="Anl4g05470" 



/note="unnamed protein product; 
Function: enterophilins are a new 
family of proteins with unknown 
function associated with 
enterocyte differentiation in C. 
porcellus. Title: weak similarity 
to hypothetical enterophilin-2L - 
Cavia porcellus" 
/citation=[84] 
/codon-start=l 
/protein-id="CAK42108.1" 
/db-xref="GI : 134081853" 
/translation="MTTTLERSLSRTSSMSIPVS 
SPRLSLIHENTPPALADPSLSHIH 
DRLTFLDSRLLELRSSVLTKDGYVDRRNREDEHI 
RREFEAHRS I S SRI DLNWALKTD 
VDQLKSGISQLKSSIGQSGNETVFLRSDVDRLSK 
NVEQIQSDVEHLQTDVCGCRVEIS 
KLHATISQLRTDLITLQHETSRHLTSVFNRFSLI 
EARMKHSERVRFNSLAHTTHAPIT 
PVPWEEDGSLQWPEYFPRTVWRFWCLKKRSRIN 
RLAQLAEFYQLGGYEYWGRMHQEQ 
TLFASDSDSSDSSDYPSNLSRAEAVRLYPEAAHQ 
ALAATLGLVYYKIRNEVGEGPNAS 
ITRPPKRQQEEAISATSSKEKPVKMARRPTVSPT 
TLKRLITGPSVESKSLTSEESDKL 
GWNAY SDVSDDTMSKLRGIVSDEVGTLLRALEKG 
RIRLRSNRSERTNLSPTESRAAES 
KAGSRNSKAAPEDEPTLPNTIPTEVLSISSDHAE 
KIGDHELPDTASDASSPLT" 
complement (87776. .88536 /locus-tag="Anl4g05470" 
) 



complement (88537. .88588 



/number=l 

/locus-tag="Anl4g05470" 



/number=l 

complement (88589. .89339 /locus-tag="Anl4g05470" 



<91050. .>91256 
<91050. .>91256 
91050. .91256 



91050. .91127 



/number=2 

/locus-tag="Anl4g05480" 

/locus-tag="Anl4g0548 0" 
/locus-tag="Anl4g0548 0" 
/ codon-start=l 

/product=" hypothetical protein" 
/protein-id="CAK42109 .1" 
/db-xref="GI : 134081854" 
/translation="MPASDKKKRALKIAFLTLLW 
VHF AGKL DEIS GE ADD S DADE DE T 
lALTDASDEMMIKAMEAATRLLDE" 
/locus-tag="Anl4g05480" 
/ inf erence="protein 



mat -peptide 

exon 

gene 
mRNA 
CDS 



91128. .91253 

91050. .91256 

<92137. .>93393 
<92137. .>93393 
92137. .93393 



sig-peptide 

mat-peptide 

exon 

gene 

mRNA 

CDS 



92137. .92205 

92206. .93390 

92137. .93393 

complement (<94126 . . >979 
82) 

complement ( join (<94126 . 
.94820, 94895. .>97982) ) 
complement (join (94126 . . 
94820,94895. .97982) ) 



motif :SignalP: 2.0" 
/locus-tag="Anl4g05480" 
/product="hypothetical protein" 
/locus-tag="Anl4g05480" 
/number=l 

/locus-tag="Anl4g05 49 0" 

/locus-tag="Anl4g05 49 0" 

/locus-tag="Anl4g05490" 

/inference="profile:COGS:COG2273" 

/inf erence="similar to AA 

sequence :PIR: SI 9 012" 

/note="unnamed protein product; 

Title: weak similarity to 

precursor of licheninase -Bacillus 

polymyxa" 

/citation=[8] 

/codon-start=l 

/protein-id="CAK42110 . 1" 

/db-xref="GI : 134081855" 

/translation="MKLLLTTLILPSLLSTHLLF 

ATAISIPSPQETHPDANCDCYITS 

SFTSNSNLTAPTYFTHYKFWDFRNAPLPQSGPET 

TQTNPSNPHDTSTLEIYPLTKTLF 

SHDWRAQTWDREATDISPVPMKNTDGNVFLLKHP 

SSTPRDPRSSFLVLRTTRFQDHAS 

TAEMEGLFGHIYHCSLRLRMRLMSRDDIVTTAAS 

THPNNQPGSQPGSQSKIPQGSCAG 

IFTYRSSICESDIEILTTEDSHTVHYANQPDYDP 

ISDTLIPGASSIVNLTKPWTSWTT 

HRLDWVADMSAWYADGQLQSKSNYSVPDRPSIVA 

MNLWSNGGNWSGDMKVGESVFMGI 

EWIELVYNVTKADQDLPIKDHPGHRDRYHRVGGL 

GSASASFSSSASSVGGDLVDEEDV 

ERQMGKAKEKS DGCQRPCWI DPWHPV" 

/locus-tag="Anl4g05 49 0" 

/inf erence=" protein 

motif :SignalP:2.0" 

/locus-tag="Anl4g05490" 

/product=" unnamed" 

/locus-tag="Anl4g05490" 

/number=l 

/locus-tag="Anl4g05500" 
/locus-tag="Anl4g05500" 



/locus-tag="Anl4g0550 0" 

/inf erence="prof ile : COGS : COGS 1 6 0 " 
/inference="prof ile:PFAM:PF02902" 
/inf erence="similar to AA 
sequence:UniProtKB:AF260129 .1" 

/note="unnamed protein product; 
Similarity: the BLASTP alignment 
is very nice, but limited to the 
C-terminal part of the predicted 
ORF. Title: similarity to 
axin-associating molecule Axam - 
Rattus norvegicus" 
/citation=[79] 
/ codon-start=l 
/protein-id="CAK42111 . 1" 
/db-xref="GI : 134081856" 



/translation="MDRMFTVWWEYPQKVNSKCI 

VHPPAGTHHVTCWMRKSGRQAPSE 

VELICCCSITCFVSSSTSRGFIASPPSLRLPRPS 

PTPRSPRAFLWSSNSLRRRARLPP 

SLWQSPMAALSEYICFGSSFPFAHPGAALVFFFT 

LLFLLFFFFFFFFPFSPITHLPIT 

FRRVYELSEYSNFPFLAIELVPLLSPLLSPKELV 

SRLLLTMS GVF S TMRP S VE DHPMP 

DVCYMPSDSAPHEEAVDNTAHDPMDCAPDSPASN 

IPQETAKAPSAEIPKYQLRSVHVP 

GNPKAVRLNGKHEVKPSFPLRTPLIRPDHRFSPN 

SRAASFLGVDALQKSGRNAHSSHL 

LSHLGTSKYLSAGLNHVSWSNNKPLSLPFGPDRS 

NETGYLLGGKSIGIDKKTFHMNVN 

KHSGFTTPSRKRSMEGDGYEQAQPITSPSAKFRR 

VRAGSFARSAATELEATANEDDWY 

PGSPMDIDTPEHNNEPNMVTALAAVESHLALPET 

SPKSRRLDASKVKPVTALAVSTTP 

STPENHMMPGAWPESVPSHWSTDAATDIEDSPMI 

SDEEILSNNTDMADAGEPTIPVNS 

DAEPQPSSPWYAYWGTWHSYLQTWSIPTGITQN 

FSYAARSAANAVGAARRRVCALWN 

RRGGRRLPSPPGSPTRLNLRNLSPEQQERVDRDR 

RIRSGQSSPRTDKSFPDFSNVPNF 

SHNNNTAPTVESSQILQDLVSQEQMPQALSDSVS 

VQRRAHEPKNVGTRKHPGMPPVKQ 

PARRSPPRSAQRSAQRSTQDTAQKPLRRQGSPGI 

RKQKHITRAERAENRERSAAFRAL 

FVGDIAGMKKAMEAQAARDRQASAAGRRPVELNL 

MRSSYELDRAQDLNMLANVQSRSF 

TAVDQLGATEKVL S S S I TVAPKT I LKI RGRP SIS 

GNRTRSRRATPRRVRFREPWAEF 

IDDTRDWGEVEDNLAPHLGDTLDVNERTTETSVG 

TKAE YTEKENVPPALENEEENEP I 

VDPWSQPTFEFPLGRAVSAVSLFYPEPKPIPEGR 

TESVYADRWRQIQEEERQKALPSH 

GRVSGGKTVATTLSGDPLTKRSLATCYTRGEWLN 

DEVINGYLALIVDYLRRKNHNAGR 

NDKPRFHAFNSFFFSNLRDKGYESVARWAKRAKI 

GG S L L L DVDT VY I P VHN S QHWT L V 

WRPGERSIEHFDSLGARSRRHIAWQTWLRGEL 

GPKYVEEEWRVLPSLSPQQDNGSD 

CGVFLLTTAKAVAIGLEPLSYGAQDTPLLRRKIV 

AELMAGGLEGEFDPAQQDHVLL " 

complement (94126 . .94820 /locus-tag="Anl4g05500" 

) 

/number=l 

complement (94821 . .94894 /locus-tag="Anl4g05500" 
) 

/number=l 

complement (94895. .97982 /locus-tag="Anl4g05500" 
) 

/number=2 

complement (<98558 . .>100 /locus-tag="Anl4g05510" 

377) 

complement (join (<98558 . /locus-tag="Anl4g05510" 
.98608,98659. .99094, 
99183. .99953, 
100034. .>100377) ) 

complement (join (98558 . . /locus-tag="Anl4g05510" 
98608,98659. .99094, 
99183. .99953, 



exon complement(98558. .98608 

intron complement ( 98609 .. 98658 

exon complement(98659. .99094 

intron complement ( 99095 .. 99182 

exon complement (99183. .99953 



/inference="profile:COGS:COG0671" 
/inference="prof lie: PFAM:PFO 1569" 
/inf erence="similar to AA 
sequence : PIR: S38127" 
/note="unnamed protein product; 
Function: YSR3 of S. cerevisiae is 
involved in regulating ceramide 
and phosphorylated sphingoid base 
levels, and modulates stress 
responses through sphingolipid 
metabolites. Remark: YSR3 of S. 
cerevisiae is also called YKR053c. 
Similarity: A. niger EST 
EMBLEST:BE760128 comprises part of 
the putative 5 ' -UTR of the gene 
and part of exon 1. Title: strong 
similarity to 

dihydrosphingosine-1 -phosphate 

phosphatase Ysr3 -Saccharomyces 

cerevisiae" 

/citation=[35] 

/citation=[38] 

/citation= [ 45] 

/citation=[49] 

/ codon-start=l 

/protein-id="CAK42112 .1" 

/db-xref="GI: 134081857" 

/translation="MVNGKNGQPDAGLRSLDHCM 

WCRCGIAIVFRGLTATARQLVSDR 

NQLPYWRYWPRQKLLPLIRYETPYLAWVQERVRT 

PALDSYFAFTANLGTHTFFMVFLP 

ILFWCGYTSLGRGMVHLLASGVFFSGFIKDLLCL 

PRPLSPPLQRITMSGSAALEYGFP 

STHSTNAVSVAAYALALLNSSESTLSPQVNIALQ 

AITYLYVCSIVLGRLYCGMHGFFD 

WIGCLLGFLLAFLQYTLGPTIDEYVLSATGRGA 

TLVILLILGLVRIHPEPADDCPCF 

DD S VAF AGVML GAQVAYWH I ARL S L VWDEP VP AT 

VPFEFRQVGLVKTSLRLVIGVLML 

FTWRAVTKPLLLRALPPVFRGLEKLGL I LPRRFF 

RNASQYTKVPSQLRDHEVLPGFSE 

IPSIITTIRHP RRRAI S I GP Q S E AD AYE T LAY RE 

KRRRESLSTANQASPVPEEGGPVS 

ESEPRKLSRKPSKLHEYENMMGTGAPAVASSVDV 

SSQEASSTVPFPPFESEIEADEKE 

VFSQIKQPRVNPCASSYKVADADCV" 

/locus-tag="Anl4g05510" 

/number=l 

/locus-tag="Anl4g05510" 
/number=l 

/locus-tag="Anl4g05510" 
/number=2 

/locus-tag="Anl4g05510" 
/number=2 

/locus-tag="Anl4g05510" 



/number=3 



gene 
mRNA 



complement (99954. .10003 
3) 

complement (100034 
77) 

complement (<101552 . .>10 

2291) 

complement ( join (<101552 
. .101717, 
101818. .101872, 
102049. .102103, 
102196. .>102291) ) 
complement (join (101552. 
.101717, 101818. .101872, 
102049. .102103, 
102196. .102291) ) 



/locus-tag="Anl4g05510" 
/number=3 

/locus-tag="Anl4g05510" 
/number=4 

/locus-tag="Anl4g05520" 
/locus-tag="Anl4g05520" 



/locus-tag="Anl4g05520" 



complement (101552. .1017 
17) 

complement (101718. .101! 
17) 

complement (101818. .1018 
72) 

complement (101873. .1020 
48) 

complement (102049. .1021 
03) 

complement (102104. .1021 
95) 

complement (102196. .1022 
91) 

<103137. .>103945 
join(<103137. .103259, 
103323. .103526, 

103586. .>103945) 
join(103137. .103259, 
103323. .103526, 
103586. .103945) 



/note="unnamed protein product; 

Title: questionable ORF" 

/ codon-start=l 

/protein-id="CAK42113 .1" 

/db-xref="GI : 134081858" 

/translation="MAGPPSRATTTLSPSSLESF 

PILESHSGFSGRVPETAFDPAIQS 

SNDRRPDMLARLFHVSGSMERIPRSLRHRCLQIP 

IPAGLIPRVAVIVSFPLAHKETNT 

EPHGKTFRRPNGLAQDHAKLC " 

/locus-tag="Anl4g05520" 

/number=l 

/locus-tag="Anl4g05520" 
/number=l 

/locus-tag="Anl4g05520" 
/number=2 

/locus-tag="Anl4g05520" 
/number=2 

/locus-tag="Anl4g05520" 

/number=3 

/locus-tag="Anl4g05520" 
/number=3 

/locus-tag="Anl4g05520" 
/number=4 

/locus-tag="Anl4g05530" 
/locus-tag="Anl4g05530" 



/locus-tag="Anl4g05530" 



/inference="profile: COGS :COG0 481" 
/inference= "profile: COGS :COG0 53 2" 
/inference="prof ile:PFAM:PF0 0 0 71" 
/note="unnamed protein product; 
Remark: RHOl gene encodes a 
low-molecular-weight GTPase. one 
of its recently identified 



functions is the regulation of 

beta-1, 3-glucan synthase, which 

synthesizes the main component of 

the fungal cell wall. Similarity: 

belongs to the ras transforming 

proteins. Title: strong similarity 

to GTP-binding protein rhol from 

patent W09738129-A1 - Candida sp" 

/citation=[46] 

/codon-start=l 

/protein-id="CAK42114.1" 

/db-xref="GI : 134081859" 

/translation="MSEATNITHPPAYSKKFVW 

GDGGCGKTCLL I S YSQGYFPEKYV 

PTVFENYITQTTHRASGKTVELALWDTAGQEEYD 

RLRPLSYPETDLLFVCFAIDCPAS 

LENVMDKWYPEVLHFCPTTPLILVGLKSDLRSKR 

TCIELLKTQGLTPVTPEQGEAVAR 

RMNAS Y I EC S S KEMRGVDGVF S LAVDTWS AEEQ 

DWNDRQTSTTPGSKPKVGGGGGGK 

KVKKRSCKIL" 

axon 103137. .103259 /locus-tag="Anl4g05530" 

/number=l 

intron 103260 .. 103322 /locus-tag="Anl4g05530" 

/number=l 

exon 103323 .. 103526 /locus-tag="Anl4g05530" 

/number=2 

intron 103527 .. 103585 /locus-tag="Anl4g05530 " 

/number=2 

exon 103586. .103945 /locus-tag="Anl4g05530" 

/number=3 

gene complement (<104005. .>10 /locus-tag="Anl4g05540" 

5283) 

mRNA complement ( join (<104005 /locus-tag="Anl4g05540" 

. .104166, 

104250. .104274, 

104718. .104893, 

105194. .>105283) ) 
CDS complement( join(104005. /locus-tag="Anl4g05540" 

.104166, 10 4250.. 10 42 74, 

104718. .104893, 

105194. .105283) ) 

/note="unnamed protein product; 

Title: similarity to essential 

protein #328 from patent 

WO200286090-A2 - Aspergillus 

f umigatus" 

/codon-start=l 

/protein-id="CAK42115.1" 

/db-xref="GI : 134081860" 

/translation="MSTTPIPRNRSSSPTVFEQQ 

RAELVREIAWGNEFGSVEALWSQ 

FENFMGRPGEENEAGDGGKQKAGASGSGGGHHQD 

EEQSQLQEEDRETGIRVEQETQVH 

RGSVFHLHGLLVMSCHVTYKKRGKPSKQPGFSFG 

TTGRAEKQTSQEEE" 

exon complement (104005. .1041 /locus-tag="Anl4g05540" 

66) 

/number=l 

intron complement ( 104167 .. 1042 /locus-tag="Anl4g05540" 

49) 

/number=l 



complement (104250. .1042 
74) 

complement (104275. 
17) 

complement (104718. 
93) 

complement ( 104894 . 
93) 

complement (105194. .1052 
83) 

<106070 . .>107287 
join(<106070. .106170, 
106227. .106273, 
106356. .106857, 
106906. .>107287) 
join(106070. .106170, 
106227. .106273, 
106356. .106857, 
106906. .107287) 



/locus-tag= "Anl4g05540" 
/ number=2 

/locus-tag="Anl4g05540" 
/number=2 

/locus-tag="Anl4g05540" 
/number=3 

/locus-tag="Anl4g05540" 
/number=3 

/locus-tag="Anl4g05540" 
/number=4 

/locus-tag="Anl4g05550" 
/locus-tag="Anl4g05550" 



/locus-tag="Anl4g05550" 



exon 
intron 



106070. .106170 
106171. .106226 



/EC-number= " 1 . - . - . - " 

/inference="prof ilerCOGS :COG0604" 
/inference="prof ile:COGS :COG106 4" 
/inference="prof ile:PFAM:PF00107" 
/note="unnamed protein product; 
Remark: this ORF is a putative 
alginate lyase by homology. 
Similarity: belongs to the 
NADPH:quinone reductase and 
related Zn-dependent 
oxidoreductases . Similarity: the 
ORF shows similarity to several 
oxidoreductases with different 
specificities. Title: strong 
similarity to protein from patent 
WO0100844-A - Corynebacterium 
glutamicum" 
/codon-start=l 
/protein-id="CAK42116 .1" 
/db-xref="GI : 134081861" 
/db-xref="GOA:A2R3U8" 
/translation="MKAIGVENYGPVDRLEAWEV 
PKPTDLQPRDLLVRIKGVAVNPVD 
TKVRNDYYDRVPRPFQITGYDASGIVEATGSQCS 
LFNVGDEVYYSGSPIRQGSNAEYQ 
LVDERSVGRKPKSLDFVESAAMPLTYITAYEALV 
ERLEIEKEEKAALLIINGAGGVGA 
MATQIARWLDLPWITTASRPETIQFTKDMGAT 
HWNHREDVASQIEKLNLEVPLKY 
IFITHSTDQYMDTCARVCAPFGKVCSIVQGQAKM 
YGTEFMSKSLTFVWCLLGTKPYHK 
VDVESHHRILEELTDMIDAGKIKCHLTKRLKLTL 
DGVKKGHELLQSGGSIGKVGLGVD 
EPGPGEIFT" 

/locus-tag="Anl4g05550" 
/number=l 

/locus-tag="Anl4g05550" 
/number=l 



intron 

exon 

intron 

gene 
mRNA 



mat -peptide 



106227. .106273 
106274. .106355 
106356. .106857 
106858. .106905 

106906 . .107287 

complement (<1 07589 . .>10 
8580) 

complement (join (<1 075 8 9 
. .107677, 
107764. .107819, 
107918 . .107986, 
108097. .108183, 
108460. .108535, 
108574. .>108580) ) 
complement (join (107589. 
.10 76 77, 107 76 4.. 107819, 
107918. .107986, 
108097. .108183, 
108460. .108535, 
108574. .108580) ) 



/locus-tag="Anl4g05550" 
/number=2 

/locus-tag="Anl4g05550" 
/number=2 

/locus-tag="Anl4g05550" 
/number=3 

/locus-tag="Anl4g05550" 
/number=3 

/locus-tag="Anl4g0555 0" 
/number=4 

/locus-tag="Anl4g0556 0" 
/locus-tag="Anl4g05560" 



/locus-tag="Anl4 



complement ( join (108468 
.108535, 

108574. .108580) ) 



/ cocion-start=l 

/prociuct= "hypothetical protein" 
/protein-id="CAK42117.1" 
/db-xref="GI : 134081862" 
/translation="MYMRNQLLVTVIRGYLILLA 
C LVD SLLNSSVL DVGVADP S I GKE 
WGDGGVMEIPEHCGDALETRLIRRNECLCHCSLS 
STVTLIPWREAQVSDVLRAKGVNL 
HVEAFVSMQLIKDLMAESSVIPRGS" 
/locus-tag="Anl4g0556 0" 



complement (join (107592 
.107677, 107764. .107819, 
107918. .107986, 
108097. .108183, 
108460. .108467) ) 

complement (107589. .1076 
77) 

complement (107678. .1077 
63) 

complement (107764. .1078 

19) 

complement (107820. .1079 

17) 

complement (107918. .1079 
86) 

complement (107987. .1080 
96) 



/ inf erence="protein 
motif : Signal? : 2.0" 
/locus-tag="Anl4g05560" 



/product="hypothetical protein" 
/locus-tag="Anl4g05560" 



/number=l 

/locus-tag="Anl4g05560" 



/number=l 

/locus-tag="Anl4g0556 0" 



/number=2 

/locus-tag="Anl4g0556 0" 



/number=2 

/locus-tag="Anl4g0556 0" 



/number=3 

/locus-tag="Anl4g05560" 



gene 
mRNA 



complement (108097. 
83) 

complement (108184.. 1084 

59) 

complement (108460. .1085 
35) 

complement (108536 . 
73) 

complement (108574. .1085 
80) 

complement (<109706 . .>11 
1182) 

complement ( join (<109706 
. .110389, 
110813. .110968, 
111117. .>111182) ) 
complement (join ( 109706 . 
.110389, 110813. .110968, 
111117. .111182) ) 



/number=3 

/locus-tag="Anl4g05560" 



complement (111120. .1111 
82) 



complement (join(109709. 
.110389, 110813. .110968, 
111117. .111119) ) 

complement (109706. .1103 



complement (110390. .1108 
12) 



/ number=4 

/locus-tag="Anl4g05560" 



/ number=4 

/locus-tag="Anl4g0556 0" 



/number=5 

/locus-tag="Anl4g0556 0" 



/number=5 

/locus-tag="Anl4g05560" 



/number=6 

/locus-tag="Anl4g05570" 



/locus-tag="Anl4g05570" 



/locus-tag="Anl4g05570" 



/note="unnamed protein product; 

Title: similarity to EST an-3478 - 

Aspergillus niger" 

/codon-start=l 

/protein-id="CAK42118 . 1" 

/db-xref="GI : 134081863" 

/translation="MDRIRRDVMLSLSVSLSGTG 

GRKPNPTPRIHRIIRFISTGNHTH 

TLRYDGRTIVGTPPSFLRMDPRQPRQSEGPRIGI 

TRYCTVCAFFIFILVLLLLCPSCP 

SRLPLCSSPTPCPLSVRTEDISASGYPDLLRFTV 

DSAPSYSLNFHHEPTSPTRLGLLF 

PPPTISRTKFPDPPRDRRRSKGNKKKKTLKPHSS 

QQEKRIDNHHKLPSTKSLINRKMR 

DMSSKFIEILDPNDTRPRMSDCDVRLEDVLADHE 

ALVTRPRSSTQSSSKPSLDNRLDR 

DAP T S P TQRWKRL S T I L VP ARRG ST" 

/locus-tag="Anl4g05570" 

/inference= "protein 
motif :SignalP: 2.0" 
/locus-tag="Anl4g05570" 



/product="unnamed" 
/locus-tag="Anl4g05570" 



/number=l 

/locus-tag="Anl4g05570" 



complement (110813. .1105 
68) 



complement (1109 
16) 



/number=l 

/locus-tag="Anl4g05570" 



/number=2 

/locus-tag="Anl4g05570" 



exon 

intron 

exon 



complement (111117. .1111 
82) 

<111686. .>112508 
join(<111686. .112171, 
112218 . .>112508) 
join(111686. .112171, 
112218. .112508) 



111686. .112171 

112172. .112217 

112218. .112508 

<113124. .>114242 
join(<113124. .113934, 
113983. .>114242) 
join(113124. .113934, 
113983. .114242) 



exon 
intron 



113124. .113934 
113935. .113982 



/locus-tag="Anl4g055 70" 
/number=3 

/locus-tag="Anl4g05580" 
/locus-tag="Anl4g05580" 

/locus-tag="Anl4g05580" 

/codon-start=l 

/product="hypothetical protein" 

/protein-id="CAK42119 .1" 

/db-xref="GI : 134081864" 

/translation="MPTKRARKNPSVAQKPPSPP 

LSEKNAVITQDIPEEEGAKRESTL 

RDESYTPELFDEDEEMPEETENTIDPDTGKAAKR 

TAEYALGGGGYNALKSYKGQMYSG 

MAVGGSHTWTYDQGTWKETKEEPDLWRIDYQTNK 

RRARKAPTGSGAPVGTEYHWLIVG 

HQHVKKIDANTYETHLTGSKYKLAYKSASSNAWS 

IPTVKKQREREVELLDDAKQRVQG 

LPPVLASEKVKVEKREKGQQRLDSMFGKAAGVKR 

KADENA" 

/locus-tag="Anl4g05580" 
/number=l 

/locus-tag="Anl4g05580" 
/number=l 

/locus-tag="Anl4g05580" 
/number=2 

/locus-tag="Anl4g05590" 
/locus-tag="Anl4g05590" 

/locus-tag="Anl4g05590" 

/inference="profile: COGS: COGS 145" 
/inference= "profile :PFAM:PF03 171" 
/note="unnamed protein product; 
Remark: similarity to human 
sequence 203 from patent 
WO0129221-A/203. Remark: this ORE 
represemnts a putative DNA repair 
protein. Title: similarity to 
sequence 203 from patent 
WO0129221-A - Homo sapiens" 
/ codon-start=l 
/protein-id="CAK42120 . 1" 
/db-xref="GI:13 40 81865" 
/translation="MSKRISTFFQPLPSKKLKPD 
ATTTAQSSSSSTTTVPPKPSTYTH 
HPSYPIPIRDPPSTIATPLLSNPAPRPPKALTNH 
PHLDLLYFQPFLPSPLARDLFQFL 
RNELPFYRVRYTIRRGGTETSINTPRYTTVFGVD 
DTSTFVIENSESTSSNSETSVQDD 
TTNNLLDPPPANSPILVDSKTRTPLTESKSKSKY 
RCRPRPIPPCLDILRQAVEKATDD 
GTRYNFVLVNYYATGDDS I S YHSDDERFLGQNPT 
lASLSLGAGRDFLLKHKPAAKPLK 
FPLKSGDMLIMRGETQSNWLHSVPKRKGLQGSAG 
ALGRINITFRRAWPGGTENYYRY 
NVGDGEVYRWRDEEGKMVATTE " 
/locus-tag="Anl4g05590" 
/number=l 

/locus-tag="Anl4g05590" 



exon 
gene 
mRNA 

CDS 



113983. .114242 



gene 
mRNA 



/number=l 

/locus-tag="Anl4g05590" 
/number=2 

/locus-tag="Anl4g05600" 



complement ( <114314 . . >11 
5364) 

complement ( join (<114314 /locus-tag="Anl4g05600" 

. .114481, 

114528 . .>115364) ) 

complement ( join ( 114314 . /locus-tag="Anl4g05600" 
.114481, 

114528. .115364) ) 



complement (11 43 14. .1144 

81) 

complement (114482. .1145 
27) 

complement (114528. .1153 
64) 

complement (<115782 . .>11 
7895) 

complement (join(<115782 
. .115925, 
116023. .116208, 
.116437, 
.117066, 
.117654, 
>117895) ) 

n(115782. 



116389 
116929. 
117455. 

117890 . 

complement ( joi. 
.115925, 116023 . . 116208 
116389 . .116437, 
116929 . .117066, 
117455. .117654, 
117890. .117895) ) 



/inference="prof lie: COGS: COGO 439" 

/inf erence="similar to AA 

sequence :PIR:B703 47" 

/note="unnamed protein product; 

Title: similarity to hypothetical 

D-alanine-D-alanine ligase B - 

Aquifex aeolicus" 

/codon-start=l 

/protein-id="CAK42121 . 1" 

/db-xref="GI : 134081866" 

/db-xref="GOA:A2R3V3 " 

/translation="MLGYQDSGSDIAYTLRNRGV 

QWTPVSSPSPKEQEGWCFPDTEN 

GILSAVRQGATHLWANTILFSSHPLQTSRSLASI 

PISWGQPPTLVESFDDKAYLNDQ 

LRIHGSFTLPRSWLISNSDNISPILDSIPCYPW 

GKPVRGRGSHGVKVCHDRAELEEH 

ISALLLESPIVMVEEFLTGEEATITVMPPSREGP 

EYWSLPPITRFNHVDGIAPYNGTV 

AVTANSRAVTESEMQDESYRSAMRQCEQVAKLIG 

ATAPIRIDIRRFSEGSQFAIFDIN 

MKPNMTGPGRPGREDQAS L T ALAAAAI GWDYGML 

LHKILSSAQPLDRFRAYSNPFASH" 

/locus-tag="Anl4g05600" 

/number=l 

/locus-tag="Anl4g05600" 
/number =1 

/locus-tag= "Anl4g05600" 
/number=2 

/locus-tag="Anl4g05610" 
/locus-tag="Anl4g05610" 



/locus-tag="Anl4g05610" 



/note="unnamed protein product; 
Title: questionable ORF" 
/codon-start=l 
/protein-id="CAK42122 .1" 



complement(115782. .1159 
25) 



complement (115926. .1160 
22) 



/db-xref="GI : 134081867" 

/translation="MDWEITDRLWANALGLRVP 

SALTVQRSALSRVSVERLAQDSSQ 

RIFSSAWRKCGHLRLAMSAIGKSVSHYSDVIRHK 

HAVIAHVIMLAPNRSVFSLIPEFL 

VPDILSIISLAPRMGCTPESMDPVTGVCLVEAAL 

RTTRHCQTAVARPLKYDIPLSLSR 

FKLAWLFTGRPILCDYGTLIDEESMLFISDITSV 

SDTGRNDYAFYFMIHHPLCQFATG 

HATFKGPVSETFWQLLLTMHR" 

/locus-tag="Anl4g05610" 

/number=l 

/locus-tag="Anl4g05610" 



complement (116023. .1162 
08) 



complement (116209. .1163 



complement (116389. .1164 
37) 



complement (116438. .1169 
28) 



complement (116929. .1170 
66) 



complement (117067.. 1174 

54) 



complement (117455. .1176 
54) 



complement (117655. .1178 



complement (117890. .1178 

95) 

<117920. .>118997 
join (<117920 . .118177, 
118295. .118597, 
118734. .>118997) 
join(117920. .118177, 
118295. .118597, 
118734. .118997) 



/number=l 

/locus-tag="Anl4g05610" 
/number=2 

/locus-tag="Anl4g05610" 
/number=2 

/locus-tag="Anl4g05610" 
/number=3 

/locus-tag="Anl4g05610" 
/number=3 

/locus-tag="Anl4g05610" 
/number=4 

/locus-tag="Anl4g05610" 
/number=4 

/locus-tag="Anl4g05610" 
/ number=5 

/locus-tag="Anl4g05610" 
/number=5 

/locus-tag="Anl4g05610" 

/ nuraber=6 

/Iocus-tag="Anl4g0562 0" 
/locus-tag="Anl4g0562 0" 



/locus-tag="Anl4g05620" 



/note="unnamed protein product; 
Title: similarity to EST an-2924 - 
Aspergillus niger" 
/ codon-start=l 
/protein-id="CAK42123 .1" 
/db-xref="GI : 134081868" 
/translation="MEHPISLCQRCRVLQFNEQD 
YGTVENESLGNCTWFDHFDDEDY 
YEDDPFYEDDQDIRRQEISLDYQLLDKFPDFWFL 
RHPLHLDVIVSIVCNGSKACIRDK 
GNTVSRNLDRGILYQPSLATPRHEIIGSLRTPLD 
DEESIAPSWSWVSNPNPVKWGGET 



exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



117920. .118177 
118178. .118294 

118295. .118597 
118598. .118733 
118734. .118997 



complement (<1 192 72 . .>12 

1189) 

complement ( join (<119272 
. .119830, 
119883. .120299, 
120386. .120822, 
120879. .121016, 
121097. .>121189) ) 
complement (join (119272. 
.119830, 119883. .120299, 
120386. .120822, 
120879. .121016, 
121097. .121189) ) 



NNFDPLFMCQSKIASINMKTNLNPYGQYVADVRF 

DWTHDSSEAQTHSTEKEKDILNML 

SMVLILRYDGEEEDFMMGLLVLPTNMPSVYRRSG 

LFSCENGQEFWYNIDTPIVALV" 

/locus-tag="Anl4g05620" 

/ number=l 

/locus-tag="Anl4g05620" 
/number=l 

/locus-tag="Anl4g0562 0" 
/number=2 

/locus-tag="Anl4g0562 0" 
/number=2 

/locus-tag="Anl4g05620" 
/number=3 

/locus-tag="Anl4g05630" 



/locus-tag="Anl4 



/locus-tag="Anl4 



/EC-number="6 .2.1.12" 
/inference="prof ilerCOGS :COG0318" 
/inference="prof ile:PFAM:PF0 0501" 
/note="unnamed protein product; 
Catalytic activity: ATP + 
4-coumarate + CoA = AMP + 
pyrophosphate + 4-coumaroyl-CoA. 
Pathway: flavonoids, stilbene and 
lignin biosynthesis. Remark: the 
phenylpropanoid enzyme 
4-coumarate : coenzyme A ligase 
(4CL) plays a key role in general 
phenylpropanoid metabolism. 4CL is 
related to a larger class of 
prokaryotic and eukaryotic 
adenylate-f orming enzymes and 
shares several conserved peptide 
motifs with these enzymes. 
Similarity: belongs to the 
Acyl-CoA synthetases 
(AMP-f orming) /AMP-acid ligases II. 
Title: strong similarity to 
4-coumarate-CoA ligase 3 4CL3 - 
Populus balsamifera" 
/citation=[83] 
/codon-start=l 
/protein-id="CAK42124. 1" 
/db-xref="GI : 134081869" 
/db-xref="GOA:A2R3V6" 
/translation="MIFEPAARAPVPCTDVISYV 
FSDPPYDHDEPIYVDVHNPSRSIS 
YNQARTIVRQLVSGLRAWGVQKGDCVAIHSFNDI 
YYTMLVLAIVGAGGIFTGSNPAYT 
AFELGHHFRSSATNFVITEPEHLDAITTAAKAAS 
IPEHRIRVFDVLGQSVPDGAVSWT 



gene 
mRNA 



complement ( 119272 . 
30) 

complement (119831. .1198 

82) 

complement (119883. .1202 
99) 

complement (120300. .1203 
85) 

complement (120386. .1208 
22) 

complement (120823. 
78) 

complement (120879. 
16) 

complement ( 121017 . 
96) 

complement (121097. .1211 
89) 

complement (<12 12 71 . .>12 

2416) 

complement (join(<121271 
. . 121409, 
121457. .121657, 
121705. .121797, 
121861. .122051, 
122150. .122275, 
122396. .>122416) ) 
complement ( join (121271 . 
.121409, 121457. .121657, 
121705. .121797, 
121861 . . 122051, 
122150. .122275, 
122396. .122416) ) 



ELLEHGEQDWVRFDDENLSRTTTAARLFSSGTTG 

LPKAAE I THMNLVAQHEFVFEFNP 

RPWRWIAVPLFHAAAAPSSHFGSLKAGHINYVM 

RRFDLPLFLQTVEKYQVTEMAIVP 

PIAIAIIMHPMSYERGYLRSIRASNLGAAPMDKD 

AQKRFQRLLGPGANCTQVWGMTET 

CC I ATMFRWDE GDE TG S VGRL VPNMEAKLVDDNG 

TD I S DYGVRGE LCVRGPAVTPGYF 

NNPVANAESFDEQGWFHTGDIAYCDCATQKWYIV 

DRKKELIKVRGFQVAPPELEAVLL 

AHPLIVDAAVIGLRDWPGTELPRAYWRRPETD 

ESKLTEDMVKSWLLERLAGYKALT 

GGVKF VP SIPKTASGKI LKRVLREE S RRE VKE S G 

WRPRL" 

/locus-tag="Anl4g05630" 

/ number=l 

/locus-tag="Anl4g05630" 



/number=l 

/locus-tag="Anl4g05630" 



/number=2 

/locus-tag="Anl4g05630" 



/number=2 

/locus-tag="Anl4g05630" 



/number=3 
/locus-tag="Anl4 



/number=3 
/locus-tag="Anl4 



/number=4 
/locus-tag="Anl4 



/number=4 
/locus-tag="Anl4 



/number=5 

/locus-tag="Anl4g05640" 



/locus-tag="Anl4 



/locus-tag="Anl4g05640" 



/ codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK42125.1" 

/db-xref="GI : 134081870" 

/ t r an s 1 at i o n= " MAVAAVS GRREPP S AMKMT I 

QEVGMFEQINLVPVIPGASSVIPV 



complement (121271. .1214 
09) 



complement (121410. .1214 
56) 



complement (121457. .1216 

57) 



complement ( 121658 . . 1217 

04) 



complement (121705. .1217 
97) 



MAIFTADRIADKRMVCDTQVRSIAVIADRIRADL 

EDSTSVRTISTVLRSLSGLVLPLS 

RPTKYESLVLGGAENEGGSEVAGSAVIGDKHGPR 

TQTATPRPQLPNAAWRVGGLPAGG 

GLSARLAAIGLSDPFPTTRNTKTRAPPARPVHAN 

SSIFSGLQVWRSLNGLPHTMPAMC 

PALESVPLPAPGDVPCWPSRQNDKPYFSIGAGFI 

QLKL" 

/locus-tag="Anl4g05640" 
/number=l 

/locus-tag="Anl4g05640" 
/number=l 

/locus-tag="Anl4g05640" 

/number=2 

/locus-tag="Anl4g05640" 
/number=2 

/locus-tag="Anl4g05640" 



complement (121798. .121! 
60) 



complement (121861. .1220 
51) 



complement (122052. .1221 
49) 



complement (122150. .1222 
75) 



complement (122276. .1223 
95) 



complement (122396. .1224 
16) 



/number=3 

/locus-tag="Anl4g05640" 



/number=3 

/locus-tag="Anl4g05640" 



/number=4 
/locus-tag="Anl4 



/number=4 
/locus-tag="Anl4 



/number=5 
/locus-tag="Anl4 



/number=5 
/locus-tag="Anl4 



<1227 



12294 
12308 
12330^ 
12387 
12412 
12449 
12468: 
12488' 
12504 
12535 
12591 
12609: 
12627: 
12662, 
12683^ 
12720: 
join ( 
1229' 
12308 



1 . .>127337 
122711 . . 122827, 
. .123005, 
. . 123243, 
. .123409, 
. .124023, 
. .124263, 
. .124605, 
. .124801, 
. .124955, 
. .125228, 
. .125490, 
. .126029, 
. .126179, 
. .126578, 
. .126770, 
. .127004, 
. .>127337) 
122711 . .122827, 

123005, 
7. .123243, 



/number=6 

/locus-tag="Anl4g05650" 
/locus-tag="Anl4g05650" 



/locus-tag="Anl4 



exon 

intron 

exon 

intron 

exon 

intron 

intron 
exon 



123306. 
123871. 
124127. 
124490. 
124682. 
124889 . 
125041 . 
125358. 
125911 . 
126093. 
126273. 
126625. 
126836. 
127202. 



.123409, 
.124023, 
.124263, 
.124605, 
. 124801, 
.124955, 
. 125228, 
. 125490, 
. 126029, 
.126179, 
.126578, 
.126770, 
.127004, 
.127337) 



122711 . . 122827 
122828. .122947 
122948. .123005 
123006. .123086 
123087. .123243 
123244. .123305 
123306. .123409 
123410. .123870 
123871. .124023 



/codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK42126 .1" 

/db-xref="GI : 134081871" 

/translation="MTIHFLNGKIILTAPDSRMN 

LKNSRDSHDDVDSLCAMSQHPWCR 

YRPWSPLGLGHNLDVPIFPRWRRTSSRIFFEMAR 

PHQPTAKPSGALHNAADRLVPFCT 

CLLFAYNLTASCQNAIAGAGSLCDVGASFMIEWA 

NIAEYHIGFVPISNIQGVRSVLQG 

SPQHRENEMPMSEVPNPKVLWLNPARRRVFHTFN 

: ALYMLGV 

AIF-^ LEIIEDL^T F ^KAMPLLTRMSGMFAKTP 

PTSWVGMTGQRCCGHLLAGGSGLG 

GKTVRIACKPDVSLAPFFHFVATRRVIRADGSAR 

AGYFCYSSRVRYDSGAQTPWEVK 

LVSSLIWRQTEYSMASLLGGSRAGEGSKGKRRRN 

VTNWNQSTLSLACPEQVNQAVRLL 

RDPVTETLYCCLLRKTADECFVCSREPSSGHGCS 

TPLHNTRI SRASNTCRDQPLWRLM 

NTGDQVIRLNGWWHGRPITHKWKISTVREVGNGT 

IDRGGRATEIIMVSTSFDHFHPGC 

LTCKSRVKACRWFGGHLWTLSDFRGAFQLPIS 

HYYITTSSYIAQEPIPPPALPLLL 

HPVLLALPSLPNMVALSTLYNTQYLGHSTHYHSL 

FSPSHSRPATTWRVSLSYGSSIRW 

QSTNQIESEDRRRVSERHILFFTAGFPSFLADGV 

HDSWLYSPEAPDSAGVPEWTGVGA 

FGWRPVAYAGLCVS GL S GVRR ISPSPSPYRLAIP 

PFPLSSTRPQIYLVSVDDAGFGSD 

LDGSIAQYPHRRLRIYFQEARSTVAQAILA" 

/locus-tag="Anl4g05650" 

/number=l 

/locus-tag="Anl4g05650" 
/number=l 

/locus-tag="Anl4g05650" 
/number=2 

/locus-tag="Anl4g05650" 

/ number=2 

/locus-tag="Anl4g05650" 
/number=3 

/locus-tag="Anl4g05650" 
/number=3 
/locus-tag= 
/number=4 
/locus-tag= 
/number=4 
/locus-tag= 



'Anl4g05650" 
'Anl4g05650" 
•Anl4g05650" 









/number=5 






intron 


124024. 


.124126 


/locus-tag= 


"An 14c 


J05650' 








/number=5 






exon 


124127. 


.124263 


/locus-tag= 


"Anl4g05650' 








/number=6 






intron 


124264. 


. 124489 


/locus-tag= 


"Anl4g05650 








/number=6 






exon 


124490. 


.124605 


/locus-tag= 


"An 14c 


J05650' 








/number=7 






intron 


124606. 


.124681 


/locus-tag= 


"An 14c 


J05650' 








/number=7 






exon 


124682. 


.124801 


/locus-tag= 


"An 14c 


J05650' 








/number=8 






intron 


124802. 


.124888 


/locus-tag= 


"An 14c 


J05650' 








/number=8 






exon 


124889 . 


. 124955 


/ locus-tag= 


"Anl4g05650 








/number=9 






intron 


124956. 


.125040 


/locus-tag= 


"An 14c 


J05650' 








/number=9 






exon 


125041. 


.125228 


/locus-tag= 


"An 14c 


J05650' 








/number=10 






intron 


125229. 


.125357 


/locus-tag= 


"An 14c 


J05650' 








/number=10 






exon 


125358. 


.125490 


/locus-tag= 


"An 14c 


J05650' 








/number=ll 






intron 


125491 . 


.125910 


/locus-tag= 


"Anl4g05550 








/number=ll 






exon 


125911. 


.126029 


/locus-tag= 


"An 14c 


J05650 








/number=12 






intron 


126030. 


.126092 


/locus-tag= 


"An 14c 


J05650' 








/number=12 






exon 


126093. 


.126179 


/locus-tag= 


"An 14c 


J05650' 








/number=13 






intron 


126180. 


.126272 


/locus-tag= 


"An 14c 


J05650' 








/number=13 






exon 


126273. 


.126578 


/ locus-tag= 


"Anl4g05650 








/number=14 






intron 


126579. 


.126624 


/locus-tag= 


"An 14c 


J05650 








/number=14 






exon 


126625. 


.126770 


/locus-tag= 


"An 14c 


J05650' 








/number=15 






intron 


126771. 


.126835 


/locus-tag= 


"An 14c 


J05650' 








/ number=15 






exon 


126836. 


.127004 


/locus-tag= 


"An 14c 


J05650' 








/ number=16 






intron 


127005. 


. 127201 


/locus-tag= 


"Anl4g05650' 








/number=16 






exon 


127202. 


.127337 


/locus-tag= 


"Anl4g05650' 








/number=17 






gene 


<127622 


. .>128417 


/locus-tag= 


"An 14c 


J05660' 


mRNA 


join(<127622. .127779, 


/locus-tag= 


"An 14c 


J05660' 




128125. 


.128227, 










128313. 


.>128417) 








CDS 


join(127622. .127779, 


/locus-tag= 


"An 14c 


J05660' 




128125. 


.128227, 










128313. 


.128417) 









/note=" unnamed protein prociuct; 
Title: questionable ORF" 
/cocion-start=l 
/protein-id="CAK42127.1" 
/db-xref="GI : 134081872" 
/translation="MPRELGCGEVYGMGDMPTVR 



exon 

intron 

exon 

intron 

exon 



127622. 

127780. 

128125. 

128228. 

128313. 

<128987 
join (<1 
130286. 
join(12 
130286. 



27779 
28124 
28227 
28312 
28417 



QP DG I RVGAL S P L I S GMAI RVGGR 

GIMPTGKRTDAGQLQIKPCRSSYSPPFVEQLSQQ 

ELELPSLGRAERGIFTPLESGQMY 

QNKRIRILPFASRSVDRAA" 

/locus-tag="Anl4g05660" 

/ number=l 

/locus-tag="Anl4g05660" 
/number=l 

/locus-tag="Anl4g0566 0" 
/number=2 

/locus-tag="Anl4g0566 0" 
/number=2 

/locus-tag="Anl4g05660" 
/number=3 

/locus-tag="Anl4g05670" 



.>130729 

3987 . . 130216, /locus-tag="Anl4g05670" 
>130729) 
987. .1302 
130729) 



exon 
intron 



128987. 
130217. 



30216 
30285 



/locus-tag="Anl4g05670" 



/inference="prof ile:PFAM:PF00172" 

/inf erence="similar to AA 

sequence:UniProtKB:AB035491 . 1" 

/note="unnamed protein product; 

Remark: the Japanese pear 

pathotype of Alternaria alternate 

produces host-specific AK-toxin 

and causes black spot of Japanese 

pear. Transformation of the wild 

type with AKTR-1- and 

AKT3-l-targeting vectors produced 

toxin-deficient (Tox-) , 

nonpathogenic mutants. Title: 

similarity to AK-toxin regulating 

protein AktR-1 - Alternaria 

alternata" 

/citation= [74] 

/codon-start=l 

/protein-id="CAK42128 . 1" 

/db-xref="GI : 134081873" 

/translation="MHTCVVNKIIVDLRDRHRRR 

CIKNIGQERQS KRKS C DAC AQKKL 

RCSMTRPSCSRCIQSRRPCVYPQSSIPVQAPNLD 

DAQDNIVTSSGSLHSVSVCAGI IP 

GGTTWALPTYPFDTPSTADEVNDVSGSSWSPETP 

TNAELAFQTVNESTMFPMQDVSLM 

NSPPWQEDLHERPETLGLEEFFSGSLDSCSPAMY 

IPPTMGSSPVTMVPPTPATLPGDS 

YATGGGLSMSSVSPGYFDNFWPDNMASTDEDHET 

WRFGTYTDSRSGSGFGQNFLPKTS 

HHSPGDIGDLYQELFSLLREYPGLALQRQFYSPF 

LHHELQGYAMQSLGQPISTTLSSI 

SSYASYLEACDTFDLSMHDGERLAAAEGCWALH 

AVCVQQILSIFGDNFATTGFSKLP 

NFAGDQGHTETPVDILLRMTQRVYKLHEDVLRTP 

HE DE T DWRRWKF AE S L RRN I F F AN 

IIRTLAGRARRYNGINIDPLDSAILLQLPLPAPE 

EMWRARSEGEWMIARAQTQRSWAR 

GVDGSMLPALRTLQQLLVLEEVGSVSVTLLLPIT 

RMI LACAKLNGAS YG " 

/locus-tag="Anl4g05670" 

/number=l 

/locus-tag="Anl4g05670" 



exon 

intron 

exon 

intron 

exon 

intron 

exon 



intron 
exon 



130286. .130729 

<131085. .>131662 
join (<131085. .131323, 
131388. .131502, 
131555. .131602, 
131645. .>131662) 
join(131085. .131323, 
131388. .131502, 
131555. .131602, 
131645. .131662) 



131085. .131323 

131324. .131387 

131388. .131502 

131503. .131554 

131555. .131602 

131603 . . 131644 

131645. .131662 

<134201. .>135346 
join(<134201. .134238, 
134452. .134502, 
134768. .134993, 
135113. .>135346) 
join (134201 . . 134238, 
134452 . . 134502, 
134768. .134993, 
135113. .135346) 



134201. .134238 
134239. .134451 
134452. .134502 



/number=l 

/locus-tag="Anl4g05670" 
/number=2 

/locus-tag="Anl4g05680" 
/locus-tag="Anl4g05680" 



/locus-tag="Anl4 



/codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK42129 .1" 

/db-xref="GI : 134081874" 

/translation="MHGSKRDRLKSAKQKLLGAC 

MSENWTWGWAWLAETPKSGHTAMP 

RPRL T TNMQAVRQAGRQADRLE DEE S RP DGEQGP 

GSLLAFPRMVTSGGGPASTRGIYL 

GGFFVPLVS I AAI S SEDELVNI SWWDWSRRVPL 

FSY" 

/locus-tag="Anl4g05680" 
/number=l 

/locus-tag="Anl4g05680" 
/number=l 

/locus-tag="Anl4g05680" 
/number=2 

/locus-tag="Anl4g05680" 
/number=2 

/locus-tag="Anl4g05680" 
/number=3 

/locus-tag="Anl4g05580" 
/number=3 

/locus-tag="Anl4g05680" 

/number=4 

/locus-tag="Anl4g0569 0" 
/locus-tag="Anl4g05690" 



/locus-tag="Anl4g05690" 



/note="unnamed protein product; 

Title: questionable ORF" 

/codon-start=l 

/protein-id="CAK42130 . 1" 

/db-xref="GI : 134081875" 

/translation="MDMNPVGSNFSRGSSGDIPR 

LGYRVGAFKRLYLPGWKSIHPPI 

YYEDLPNVADDYLEGASFPHHKPSHVCPESFYNL 

FVQYKLDETGRKEGWRPMDRDKNQ 

IRADGSATCYECVRDNRGYDSLGKISDSLSLMEK 

RIVDPFDRNTPRGVEI I lEGKAML 

FIGTFISLFHILITLYDSTFAS" 

/locus-tag="Anl4g05690" 

/ number=l 

/locus-tag="Anl4g05690" 
/number=l 

/locus-tag="Anl4g05690" 









/number=2 


intron 


134503. 


.134767 


/locus-tag="Anl4g05690' 
/number=2 




134768 . 


. 134993 


/locus-tag="Anl4g05690' 
/number=3 


intron 


134994. 


. 135112 


/locus-tag="Anl4g0569 0 
/number=3 


exon 


135113. 


.135346 


/locus-tag="Anl4g05690' 
/number=4 



gene 
mRNA 



complement (<135466 . .>13 
6476) 

complement (join (<135466 
. .135580, 
135676. .135753, 
136059. .136189, 
136290. .136364, 
136405. .>136476) ) 
complement ( join (135466 . 
.135580, 135676. .135753, 
136059. .136189, 
136290. .136364, 
136405. .136476) ) 



complement ( 135466 . . 1355 
80) 

complement (135581. .1356 
75) 

complement (135676. .1357 
53) 

complement (135754. .1360 
58) 

complement (136059. .1361 
89) 

complement (136190. .1362 
89) 

complement (136290. .1363 
64) 

complement (136365. .1364 
04) 

complement (136405. .1364 
76) 



/locus-tag="Anl4g05700" 
/locus-tag="Anl4g05700" 



/locus-tag="Anl4g05700" 



/note="unnamed protein product; 

Title: strong similarity to 

hypothetical protein encoded by 

An01gl0130 - Aspergillus niger" 

/codon-start=l 

/protein-id="CAK42131 . 1" 

/db-xref="GI : 134081876" 

/translation="MPMKDSRQLTCRLNSWIRLQ 

LEYRAGIYYSDTLTRKKPIVPSGT 

KPRCNRYLVQHYLYGIKRSCPARNSGSKCSGCTD 

AHDFMDARMENAKELQKVLDSCPS 

RIEGSWIIPDIHTSAMSATEDDSGVMTGLGVLSG 

CTLAAFKDRANWNDDDGIPG" 

/locus-tag="Anl4g05700" 

/number=l 

/locus-tag="Anl4g05700" 
/number=l 

/locus-tag="Anl4g0570 0" 
/number=2 

/locus-tag="Anl4g0570 0" 
/number=2 

/locus-tag="Anl4g05700" 
/number=3 

/locus-tag="Anl4g05700" 
/number=3 

/locus-tag="Anl4g05700" 
/number=4 

/locus-tag="Anl4g05700" 
/number=4 

/locus-tag="Anl4g05700" 



gene 
mRNA 



complement (<137703 . .>13 
8530) 

complement (join (<137703 
. .137767, 
137825. .137999, 
138078. .138298, 
138437. .>138530) ) 
complement (join(137703. 
.137767, 137825. .137999, 
138078. .138298, 
138437. .138530) ) 



/locus-tag="Anl4g05 710" 
/locus-tag="Anl4g05710" 



/locus-tag="Anl4g05710" 



complement ( 13 7703 .. 13 77 
67) 

complement (137768. .1378 
24) 

complement (137825. 
99) 

complement (138000. 
77) 

complement (138078. .1382 
98) 

complement (138299. 
36) 

complement (138437. .1385 
30) 

<139122 . .>140458 
join (<139122 . . 139252, 
139343. .139427, 
139649. .139772, 
139813. .139963, 
140010. .140120, 
140192. .140244, 



/inference="prof ile:COGS:COG0553" 
/note="unnamed protein product; 
Similarity: although the 
similarity to Rhpl6 of S. pombe is 
weak, some motifs in the predicted 
ORF suggest that it might be 
involved in DNA binding and 
repair. Title: weak similarity to 
RAD16 nucleotide excision repair 
protein homolog rhpl6p - 
Schizosaccharomyces pombe" 
/citation=[32] 
/citation= [61] 
/ codon-start=l 
/protein-id="CAK42132 .1" 
/db-xref="GI : 134081877" 
/translation="MARYDFYMMSAMFLGSWHTG 
SWPRQCEHLGAAGKIKRKRAGPCN 
IRDINQPKIKGMRVSTILRFWQPVAIYAIKEFSK 
NPMLCCCVLADVMGRGKTWTLTGY 
LVDHATYEPDKPTLIICPPHLVWQWASEIKKFTS 
KLKILVYFGDAREEPPVPVQALKT 
LSSRISDRLHLTQGDIILDTAWWS" 
/locus-tag="Anl4g05710" 

/ number=l 

/locus-tag="Anl4g05710" 
/number =1 

/locus-tag= "Anl4g05710" 
/number=2 

/locus-tag="Anl4g05710" 
/number=2 

/locus-tag="Anl4g05710" 
/number=3 

/locus-tag="Anl4g05710" 
/number=3 

/locus-tag="Anl4g05710" 
/number=4 

/locus-tag="Anl4g05720" 
/locus-tag="Anl4g05720" 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



140337. .>140458) 
join(139122. .139252, 
139343. .139427, 
139772, 
139963, 
140120, 
140244, 
140458) 



139649 
139813 
140010 
140192 
140337 



139122. .139252 

139253. .139342 

139343. .139427 

139428. .139648 

139649. .139772 

139773. .139812 

139813. .139963 

139964. .140009 

140010. .140120 

140121. .140191 

140192. .140244 

140245. .140336 

140337. .140458 

complement (<140647. .>14 
2006) 

complement (join (<140647 
. .141004, 
141055. .141527, 
141590 . .141657, 
141707. .141743, 
141796 . .141815, 
141865. .141876, 
141952. .>142006) ) 
complement (join (140647. 
.141004, 141055. .141527, 
141590. .141657, 
141707. .141743, 



/locus-tag="Anl4g05720" 



/codon-start=l 

/product="hypothetical protein" 

/protein-id="CAK42133 .1" 

/db-xref="GI : 134081878" 

/translation="MPYFAVKGKGNSFVDESQGL 

APVPSAYRSDKCDMMHAILDLNQR 

TC S THQPRP S TRAVRVP SWFHRT SHEAVE YN I TV 

TAATSTVSGSFRS LGAHP I L I LRQ 

YDALQGHILYDRGSGQLTSMQACYSRARRHPTSP 

IDNHHTAHNEEKGDITLPIDDIFN 

YQNTLYLHPLTLVLIGVARHYGENSGQVFLRRQF 

HRARKERKWDDGEKSRKMGWMWEE 

VELVSWRNGMEGFRGESAKSMYALARWIIRLHRF 

MARHLY" 

^"Anl4g05720" 



05720" 
:"Anl4g05720" 
:"Anl4g05720" 



:"Anl4g05720" 

"Anl4g05720" 



/locus-tag= 
/number=l 
/locus-tag="Anl4 
/number=l 
/locus-tag= 
/ number= 
/locus-tag= 
/number=2 
/locus-tag="Anl4g05720" 
/number=3 

/locus-tag="Anl4g0572 0" 
/number=3 

/locus-tag="Anl4g05720" 
/number=4 
/locus-tag= 
/ number= 

/locus-tag= 
/number=5 
/locus-tag="Anl4g05720" 
/number=5 

/locus-tag="Anl4g05 72 0" 
/number=6 

/locus-tag="Anl4g05720" 
/number=6 

/locus-tag="Anl4g05720" 
/number=7 

/locus-tag="Anl4g05730" 
/locus-tag="Anl4g05730" 



/locus-tag="Anl4g05730" 



141796. .141815, 
141865. .141876, 
141952. .142006) ) 



complement ( 140647 , 
04) 



complement ( 141005 . 
54) 



complement ( 141055 . 
27) 



complement (141528. .1415 



/inf erence="similar to AA 
sequence:UniProtKB:AF119672 .1" 
/note="unnamed protein product; 
Function: PTHll of M. grisea is 
required for appressorium 
differentiation in response to 
inductive surface cues. Function: 
as do most fungal plant pathogens, 
M. grisea differentiates an 
infection structure specialized 
for host penetration called the 
appressorium. Phenotype : M. grisea 
cells null mutant for PTHll are no 
more pathogenic. Similarity: the 
length of the predicted ORE is 
only one half of the M. grisea 
PTHll. Title: similarity to 
integral membrane protein PTHll - 
Magnaporthe grisea" 
/citation=[63] 
/codon-start=l 
/protein-id="CAK42134.1" 
/db-xref="GI: 134081879" 
/translation="MTVSHPKDVLNTINFVTQCV 
EDWSCLTAWVLFMGYCGIAIAVGH 
YGGGYH I DDVS EAHQVLFRKFC Y I ATVL YCPMAL 
FVKIALLSILIRIFNPYRSKIYFI 
YGLLGCLCIYYIVAEIVKIRMCDPVPAYWTQDPK 
ARCLNQRAALIADSVISWTDFII 
LILPLPLTWSLQMSRNKKLRVIGMLSAGGLATAF 
SLYRLVLVLRDGS SRDQT I VFMI V 
ILSGNAEGGVAMICACLPTVNILINKLRKKEYSS 
QRYYEPESSVNLSKLKGSNKRFSL 
GNSRRADGTESASDQSHLITFAGTVDMAGGNDTG 
GIHKTVDVSQTIEMVADGDSHESH HHGSFH" 
.1410 /locus-tag="Anl4g05730" 

/number=l 
.1410 /locus-tag="Anl4g05730" 

/number=l 
.1415 /locus-tag="Anl4g05730" 



complement ( 141590 . 
57) 



complement ( 141658 . 
06) 



complement ( 141707 . 
43) 



complement ( 141744 . 
95) 



/number=2 

/locus-tag="Anl4g05730" 



/number=2 

/locus-tag="Anl4g05730" 



/number=3 

/locus-tag="Anl4g05730" 



/number=3 

/locus-tag="Anl4g05730" 



/number=4 

/locus-tag="Anl4g05730" 



gene 
mRNA 



complement (141796. .1418 
15) 

complement (141816. .1418 
64) 

complement (141865. .1418 
76) 

complement (141877. .1419 
51) 

complement (141952. .1420 
06) 

complement (<142716 . .>14 
3896) 

complement (join (<142716 
. .142868, 
142941. .143171, 
143244. .143322, 
143423. .143759, 
143845. .>143896) ) 
complement (join (142716. 
.142868, 142941. .143171, 
143244. .143322, 
143423. .143759, 
143845. .143896) ) 



/locus-tag="Anl4g05 730" 
/ number=5 

/locus-tag="Anl4g05730" 
/number=5 

/locus-tag="Anl4g05730" 
/number=6 

/locus-tag="Anl4g05730" 
/number=6 

/locus-tag="Anl4g05730" 

/number=7 

/locus-tag="Anl4g05740" 
/locus-tag="Anl4g05740" 



/locus-tag="Anl4g05740" 



complement (142716. .1421 
68) 



complement (142869. .1425 
40) 



complement (142941. .1431 
71) 



complement ( 143172 . .1432 
43) 



complement ( 143244 . . 1433 
22) 



complement (143323. .143' 
22) 



complement (143423. .1437 
59) 



/codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK42135.1" 

/db-xref="GI: 134081880" 

/translation="MEEELSGLTRAWRQQTDPL 

PNEGDRWSLTGCWNPTRVSRTREI 

GWEGRAEPWEVEVYPRSLHRTTDYTQLQFSHRLP 

RVSFFPVDPLYNMQRVANRVGFKP 

FCSLADRAQGRRLLEAQESFNSSHLRQRDGSHPT 

LHSTHSKRKSKFFIAALGVYNNPR 

QGQQRAGQACELRGSILPGLTGSMTVCRSPRHHR 

LFTDAASQPTFQRSPLCRIPYCGL 

ARTRRSLIPHSGPWRWRGAIGTSARGGSNNRRW 

LLLLVISIVRSRYFRRPAAMSPRL ARRGGGI" 

/locus-tag="Anl4g05740" 

/ number=l 

/locus-tag="Anl4g05740" 
/number=l 

/locus-tag="Anl4g05740" 
/number=2 

/locus-tag="Anl4g05740" 
/number=2 

/locus-tag="Anl4g05740" 
/number=3 

/locus-tag="Anl4g05 740" 
/number=3 

/locus-tag="Anl4g05740" 



intron 



complement (143760 
44) 



/number=4 
1438 /locus-tag="Anl4g05740" 



exon 



complement (143845 
96) 



/ number=4 
1438 /locus-tag="Anl4g05740" 



gene 
mRNA 



<143978 . .>145355 
join(<143978. .144235, 



/number=5 

/locus-tag="Anl4g05750" 
/locus-tag="Anl4g05750" 



144298 . .144933, 
144984. .>145355) 



CDS 



join(143978. .144235, 



/locus-tag="Anl4g05750" 



144298. .144933, 
144984. .145355) 



/EC-number= "1.3.1.42" 
/inference="prof ile:COGS:COG1902" 
/inference="prof ile:PFAM:PF0 0 724" 
/inf erence="similar to AA 
sequence :UniProtKB:AF293653 .1" 
/note="unnamed protein product; 
Catalytic activity: 0PR3 of A. 
thaliana catalyzes the reaction: 
8-[ (IR, 2R)-3-Oxo-2-{ (Z) -pent-2-eny 
l}cyclopentyl] octanoate + NADP+ = 
(15Z) -12-Oxophyto-lO, 15-dienoate + 
NADPH. Phenotype: A. thaliana 0RP3 
null mutants are male-sterile due 
to defects in pollen release 
caused by a delay in the stomium 
degeneration program. Remark: 
Jasmonic acid (JA) and its 
precursor 12-oxophytodienoic acid 
(OPDA) act as plant growth 
regulators and mediate responses 
to environmental cues. Title: 
strong similarity to 
12-oxo-phytodienoate reductase 
0PR3 - Arabidopsis thaliana" 
/citation=[71] 
/citation=[75] 
/ codon-start=l 
/protein-id="CAK42136 .1" 
/db-xref ="GI :134081881" 
/db-xref="GOA:A2R3W8" 
/translation="MTIVQAQGADSKLFQPLAIA 
NGKLTLSHRWHAPLTRNRGEPLN 
SNSTPENPNRIWYPGDLWEYYRQRATPGGLIIS 
EGIPPSLESNGMPGVSGLFTEEQA 
AGWKRWDTVHAQGGY I YCQLWHAGRATVPQMTG 
YPPVSASASVWDDPEERYTHPAVG 
DSEPVRYSDHPPIELTVAHIKQTIQDYCKAAKTA 
MDIGFDGVELHSGNGYLPEQFLSS 
NINRRTDDYGGTPEKRCRFVLELMDELAQTVGQE 
NLAIRLTPFGLYNQARGEQRVETW 
TYLCESLKQAHPHLSYVSFVEPRYEQIHSYEEKD 
AFLRSWGLSSVDLSSFRKIFGSTP 
FFSAGGWDQTNSWGVLEAGKYDALLYGRYFTSNP 
DLVERLRKGIPFAPYDRTRFYGPF 
E D S AFH YVD YE P APQN S TGQE S VNVS VRL " 



exon 



intron 



144236. .144297 



143978. .144235 



/locus-tag="Anl4g05750" 
/number=l 

/locus-tag="Anl4g05750" 



exon 

intron 

gene 
mRNA 



144298. .144933 



144984. .145355 

complement (<145510 . .>14 

6577) 

complement ( join (<145510 
. .145699, 
145751. .145803, 
145854. .145876, 
145933. .146045, 
146099. .146459, 
146529 . .>146577) ) 
complement (join (145510. 
.145699, 145751. .145803, 
145854. .145876, 
145933. .146045, 
146099. .146459, 
146529. .146577) ) 



/number=l 

/locus-tag="Anl4g05750" 
/number=2 

/locus-tag="Anl4g05750" 
/number=2 

/locus-tag="Anl4g05 75 0" 
/number=3 

/locus-tag="Anl4g05760" 
/locus-tag="Anl4g05760" 



/locus-tag="Anl4 



complement ( 145510 . .1456 
99) 



complement (145700. .1457 
50) 



complement (145751. .1458 
03) 



complement (145804. .1458 



/EC-number= "1.1.1.245" 
/inf erence="prof ile:COGS :COG1028" 
/inference="prof ile:PFAM:PF0 0106" 
/note="unnamed protein product; 
Catalytic activity: chnA of 
Acinetobacter catalyzes the 
reaction: Cyclohexanol + NAD+ <=> 
Cyclohexanone + NADH. Similarity: 
the gene model structure is 
confirmed by A. niger EST 
EMBLEST:BE759160, which covers 
exons 1 to 3. Title: strong 
similarity to cyclohexanol 
dehydrogenase chnA - Acinetobacter 
sp" 

/citation=[72] 

/ codon-start=l 

/protein-id="CAK42137.1" 

/db-xref="GI : 134081882" 

/db-xref="GOA:A2R3W9" 

/translation="MACKVSGTAFITGGGNGIGK 

TTAFALAQNGIEAVSLLDVNESLL 

QRTKDELATSHP Q VAVE L T VG D VS KE AC VDE AVR 

RTVERFGR I D I S VHC AG I VGQP S A 

THELTAAEWQRVIDINQTGVLLCQKAVIRQMLTQ 

ESRGLRLGRGT IVNVASMFGWAP 

GGWSGLSAYTASKHAWAFSKMDAKAYIQQEIRI 

NAICPGYTDTDMIRTYWDAGYMVP 

DSQRVAIGRRAQPEEIADAVLFLASPMSSYMVGS 

ALWDGGYTA" 

/locus-tag="Anl4g05760" 
/number=l 

/locus-tag="Anl4g05760" 
/ number=l 

/locus-tag="Anl4g05760" 
/number=2 

/locus-tag="Anl4g05760" 



gene 
mRNA 
CDS 



complement ( 145854 . 
76) 



complement ( 145877 . 
32) 



complement (145933 . 
45) 



complement ( 146046 . 
98) 



complement (146099. 
59) 



complement (146529 . . 
77) 

complement (<147404 . 

7784) 

complement (<147404. 
7784) 

complement (147404. . 
84) 



complement ( 147404 . . 

84) 

<149275. .>150765 
join(<149275. .15008 
150140 . . 150471, 
150521 . .>150765) 
join(149275. .150089 
150140. .150471, 
150521. .150765) 



/number=2 

1458 /locus-tag="Anl4g05760" 

/number=3 

1459 /locus-tag="Anl4g05760" 

/number=3 

1460 /locus-tag="Anl4g05760" 

/number=4 
1460 /locus-tag="Anl4g05760" 

/number=4 

1464 /locus-tag="Anl4g05760" 

/number=5 

1465 /locus-tag="Anl4g05760" 

/number=5 
1465 /locus-tag="Anl4g05760" 

/number=6 
.>14 /locus-tag="Anl4g05765" 

.>14 /locus-tag="Anl4g05765" 

1477 /locus-tag="Anl4g05765" 

/note="unnameci protein product; 
Title: strong similarity to EST 
an-3120 -Aspergillus niger" 
/ codon-start=l 
/protein-id="CAK42138 . 1" 
/db-xref="GI : 134081883" 
/translation="MSIGKIIGKIIIIPILVIIF 
ICVCIYFLIKHRRDRKRERREDNL 
RAQYYRQQFQQQYMYPHTQMQPQQQQQQPGTPAP 
PYYNHAYAVGQQPVAMNEGEYGYS 
ESMMKKPEPWYPVQQQHPGSEW" 
1477 /locus-tag="Anl4g05765" 

/ number=l 

/locus-tag="Anl4g0578 0" 
/locus-tag="Anl4g0578 0" 



/locus-tag="Anl4g05780" 



/EC-number="l 

/inference="prof ile:PFAM:PF01222" 
/inf erence="similar to AA 
sequence : PIR: S64014" 
/note="unnamed protein product; 
Function: the S. cerevisiae ERG4 
gene encodes sterol 0-24(28) 
reductase which catalyzes the 
final step in the biosynthesis of 
ergosterol. Remark: ergosterol is 
the precursor of vitamin D2 . 
Title: strong similarity to sterol 



exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



149275. .150089 
150090. .150139 
150140. .150471 

150472 . . 150520 

150521. .150765 

complement (<151141 
3445) 

complement (join (<15: 
. .151564, 
151613 . . 151848, 
.152339, 
.152673, 
.153109, 
.153256, 
>153445) ) 



C-24 reductase Erg4 - 
Saccharomyces cerevisiae 
endoplasmatic reticulum" 
/citation= [ 6 ] 
/citation=[18] 
/citation= [22] 
/citation= [54] 
/ citation= [ 66 ] 
/codon-start=l 
/protein-id="CAK42139 .1" 
/db-xref="GI: 134081884" 
/db-xref="GOA:A2R3Xl" 
/translation="MKQSKDRSPEGTSNAHEDEK 
PTFEFGGAPGVTMLMLGFPLLMYY 
MFIGATLYDGHLPLPDENQSITDFLSHLVDLAHT 
HAYPNRKAWIIYWTFLVLEGLGYL 
YLPGVYGKGKCLPHLNGKQLDYYCSAASSWYVTI 
AAALVLHFSGIFRLSTLVEEFGPL 
MSVAICSGFLVSIIAYVSALLRGAQHRMTGSHVY 
DFFMGAELNPRLFQWLDMKMFFEV 
RIPWYILFLL T LGT ALKQWE DYGFVSGEVSFLLL 
AHFLYANACAKGEELI ITSWDMYY 
EKWGFMLIFWNLAGVPMSYCHCTLYLASHDPSTY 
KWNPIALGVLFVMYIFAYWVWDTC 
NSQKNLFRAQERGKPVNRKTFPQLPWKSVKNPVC 
IKTKTGDSILCSGWYGMARKVHYS 
CDWFFAFSWGL I TGFNSPFPWFYSCFFTVMI IHR 
ALRDIERCRERYGEAWREYERRVP 
YLFIPVSRLELVL" 
/locus-tag="Anl4g05780" 
/number=l 

/locus-tag="Anl4g05780" 
/number=l 

/locus-tag="Anl4g05780" 
/number=2 

/locus-tag="Anl4g0578 0" 
/number=2 

/locus-tag="Anl4g05780" 
/number=3 
>15 /locus-tag="Anl4g05790" 

1141 /locus-tag="Anl4g05790" 



151899. 
152378 . 
152720 . 
153190. 
153395. 

complement ( join (151141 . /locus-tag="Anl4g05790" 
.151564, 151613. .151848, 

151899. .152339, 
152378. .152673, 
152720 . . 153109, 
153190. .153256, 
153395. .153445) ) 

/inference="prof ile:PFAM:PF0 40 82" 
/note="unnamed protein product; 
Similarity: the predicted protein 
shows similarity to several 
putative and described fungal 
transcription factors; their 



complement ( 151141 , 
64) 



complement ( 151565 , 
12) 



complement ( 151613 . 
48) 



complement ( 151849 . 
98) 



complement ( 151899 . 
39) 



complement (152340 , 
77) 



complement (152378 , 
73) 



complement ( 152674 . 
19) 



complement (152720 . 
09) 



common feature is the presence of 
a typical binuclear cluster 
zinc-finger, responsible for DNA 
binding, which is not well 
conserved in the predicted 
protein. Title: weak similarity to 
transcription activator prnA - 
Aspergillus nidulans" 
/citation=[51] 
/ codon-start=l 
/protein-id="CAK42140 .1" 
/db-xref="GI : 134081885" 
/translation="MAARTPSIARNSFRARAIKC 
DKWPCSSCSALGIACGPTGLTEV 
SEQRFSAIQEQLATIQQALHQITKPSTSVQPDSH 
PIATGPTAPPFEGQSSFHHETLVA 
RDAAYSAANTARDGQVGDYVSAALSSLKSSLDRH 
NHAMNRSGAQSPAQLNQTLLPVDL 
WAWKKVKSQPPFFLVSQSWLDSTRVEHLCQSI 
YFPLDPVPPGSLTLFYGILFYVIR 
DYLHAGDPDLAHFDLQLSLELCERHFVAGLSKPE 
IMFEPTLEKIQALLLGIQVLKAQE 
ETDVQRCWTFLSLAFNMCQSIGLHRRSTLEKDEF 
AIAEAKRHAFWALYTIDKNISLNI 
GVTSHFQDHDIDADLYAPSNDPKYRPWDLMGLVI 
VEF ANVQGRVY DQLYSTSACRADD 
EQRSASIERLSSDLMTVRDKLLAIDVSRGLYADS 
LHGMAACADFIAYSVLTVIYRAQT 
HPRDVMAVSSQCYASATAALQSHLKCFTYFRGRQ 
THKQTEYVNWILLYPSFTPFVIVF 
THAITTASTADLALLQDTASSLELIKGLSRGSMH 
LYTICDAFVRAAQILVNSQQTLTG 
LEQHQDGSLVIPTTDGPGNIALPDVPWPEDTFDS 
TMNQADISMFLNGFIGTSRSVMDI 
LSFDPNSDSI" 
.1515 /locus-tag="Anl4g05790" 

/number=l 
.1516 /locus-tag="Anl4g05790" 

/number=l 
.1518 /locus-tag="Anl4g05790" 

/number=2 
.1518 /locus-tag="Anl4g05790" 

/number=2 
.1523 /locus-tag="Anl4g05790" 

/number=3 
.1523 /locus-tag="Anl4g05790" 

/number=3 
.1526 /locus-tag="Anl4g05790" 

/number=4 
.1527 /locus-tag="Anl4g05790" 

/number=4 
.1531 /locus-tag="Anl4g05790" 



gene 
mRNA 
CDS 



complement ( 153110 . 
89) 

complement ( 153190 . 
56) 

complement ( 153257 . 
94) 

complement ( 153395 . 
45) 

<154037. .>156562 
<154037. .>156562 
154037. .156562 



.1531 /locus-tag="Anl4g05790" 

/ number=5 
.1532 /locus-tag="Anl4g05790" 

/number=6 
.1533 /locus-tag="Anl4g05790" 

/number=6 
.1534 /locus-tag="Anl4g05790" 

/number=7 

/gene="aguA" 

/locus-tag="Anl4g05800" 

/gene="aguA" 

/locus-tag="Anl4g05800" 

/gene="aguA" 

/locus-tag="Anl4g05800" 

/EC-number="3 .2.1.131" 

/inference="prof lie: COGS: COG3661" 

/inference="prof ile:PFAM:PF03648" 

/inf erence="similar to AA 

sequence:UniProtKB:ANI290451.1" 

/note="Function : hydrolysis of 

alpha-D-1, 2- ( 4-0-methyl) glucuronos 

yl links in the main chain of 

hardwood xylans . Gene-ID: aguA" 

/citation= [58] 

/codon-start=l 

/product="alpha-glucuronidase 

aguA-Aspergillus niger" 

/protein-id="CAK42141 .1" 

/db-xref="GI: 134081886" 

/db-xref="GOA:A2R3X3" 

/translation="MRGLNLFQLILALLLSMVAA 

EDGYDGWLRYAPVSCDLHCQQALP 

SHWLLNSTKGSPIETAGRELKAGFQSILSTNLT 

SRPFQCNSSASILVATLDEYRQRC 

RDINVPELDPDGFWLQSEGDTVRILGKDARGALY 

GAYEYLAMVAQRNFSRVAYATSPH 

AP I RWVNQWDNMDGS I ERGYGGAS I FFKDGT WE 

DMAPVEQYARLLAS IRINAIWNN 

VNANATLLLPENMKGLSRIADACRPYGVQIGISL 

NFASPEALGGLNTYDPLDPGVIAW 

WQNITDSLYTYVPDMAGYLVKADSEGQPGPDTYN 

RTLSQGANLFARALQPYGGVLMYR 

AFVYDDNLNESDWKADRAKAAVEYFKDLDGQFEE 

NWIQIKYGPIDFQVREPTSPLFA 

NLYHTNTAIELEVSQEYLGQQCHLVYLPPLWKTV 

LDFDLRVDHKPSMVRDIISGQRFN 

RTLGGWAAWNVGTNRTWLGSHLAMSNLYAYGRL 

AWSPTDESEQILEDWTRLTFGQNH 

HVINTISDMSMTSWPAYENYTGNLGIQTLTDILY 

THYGPNPATQDNNGWGQWTRADHD 

SVGMDRTIWNGTGYTGQYPEEVARVYESLESTPD 

DLVLWFHHVPWTHRLHSGVTVIQH 

FYNAHYAGAEAAHGFVRQWESLEGLIDRERYEAM 

RSRLVYQAGHS IVWRDAINNFYYN 

MTG I PDVAGRVGHHPWRI EAE SMGLDGYQT YT VS 

PFEAASNTTAI ITTSNSTTGTART 

SIKAPSGVYDIGVNYYDLYGGQSKWTLSVGDKW 

GQWLGDMEHNSLGHTPSIYLGGHS 



sig-peptide 



mat -peptide 



154037. .154096 



154097. .156559 



154037. .156562 



<157327 

join(<l 
157564. 
join(15 
15756 



>158616 
57327. .157515, 

158616) 
7327 . . 157515, 
158616) 



ATRITFHGVGIRQGDQLKIVGEANGVEPAPLDYI 

VLLPPGLVD" 

/gene="aguA" 

/locus-tag="Anl4g05800" 

/inference= "protein 

motif :SignalP:2.0" 

/gene="aguA" 

/locus-tag="Anl4g05800" 

/product="alpha-glucuronidase 

aguA-Aspergillus niger" 

/gene="aguA" 

/locus-tag="Anl4g05800" 

/number=l 

/locus-tag="Anl4g05810" 
/locus-tag="Anl4g05810" 

/locus-tag="Anl4g05810" 

/EC-number="2 .3 . 1.48" 
/inference="prof ile:COGS:COG5076" 
/inference="prof ile:PFAM:PF00439" 
/inf erence="similar to AA 
sequence :PIR:S2 8 051" 
/note="unnamed protein product; 
Complex: GCN5 of S. cerevisiae is 
a component of the SAGA complex, 
which posseses histone acetylation 
function and is important for 
transcription in vivo. Function: 
GCN5 of S. cerevisiae and many 
other organisms acetylates 
histones H3 and H4 non-randomly at 
specific lysines, causing 
chromatin remodelling during 
transcriptional activation. 
Similarity: in the C-terminal part 
of the protein there is a 
bromodomain, which function may be 
to tether type A histone 
acetyltransf erase to the chromatin 
during gene activation. Title: 
strong similarity to histone 
acetyltransf erase Gcn5 - 
Saccharomyces cerevisiae nucleus" 
/citation=[21] 
/citation= [23] 
/citation= [29] 
/citation=[31] 
/citation= [34] 
/codon-start=l 
/protein-id="CAK42142 .1" 
/db-xref="GI: 134081887" 
/db-xref="GOA:A2R3X4" 
/translation="MDGAELQSPLLDNGQPPLVA 
KRVTSEEPNAAADAKRLKTSDDRE 
APPQTNLPAATRIVPFPEKPAWEERNCEIEFRV 
VNNDGSTESTVILTGLKNLFQKQL 
PKMTKDYIARLVYDHTHLSLAICKMPLEIIGGIT 
FREVRHRRFAEIVFCAASSDQQVK 
GYGAHLMAHLKDYVRATSPVMHFLTYADNYATGY 
FQKQGFTKEITLDKSIWMGYIKDY 
EGGTLMQCSMLPRIRYLEVGRMLLKQKACVLAKL 



exon 

intron 

exon 



157327. .157515 



157564. .158616 

<159715. .>163074 
join(<159715. .160332, 
160378. .160543, 
160722, 
160893, 
162768, 
>163074) 
join(159715. .160332, 
160378. .160543, 
160722, 
160893, 
162768, 
163074) 



160593 
160785 
160962 
162857 



160593 
160785 
160962 
162857 



RPLSRNHIVHPPPPQWANGIVTLI 

DPLSIPAIRATGWSPDMDELARQPRHGPHFNELR 

RF L S K I QAHKQ AWPF L S P VNKDE V 

PDYYNFIESPMDLSTMEERLENDTYSTPKELIND 

LKSIFRNCRQYNDATTIYAKSATK 

LEKYIYSLIKEVPEWFDLVKE" 

/locus-tag="Anl4g05810" 

/number=l 

/locus-tag="Anl4g05810" 
/number=l 

/locus-tag="Anl4g05810" 
/number=2 

/locus-tag="Anl4g05820" 
/locus-tag="Anl4g05820" 



/locus-tag="Anl4 



/EC-number="3 .2.1.23" 
/inference="prof ile:COGS :COG18 74" 
/inference="prof ile:PFAM:PF01301" 
/inf erence="similar to AA 
sequence:UniProtKB:A00968 . 1" 
/note="unnamed protein product; 
Catalytic activity: hydrolysis of 
terminal, non-reducing 
beta-d-galactose residues in 
beta-d-galactosides . Function: 
cleaves beta-linked terminal 
galactosyl residues from 
gangliosides, glycoproteins, and 
glycosaminoglycans . Remark: 
beta-galactosidase of A. niger has 
been already patented under 
patentnumber WO9010703 and 
W09716555-A1. Similarity: belongs 
to family 35 of glycosyl 
hydrolases. Similarity: the 
absence of nice BLASTN 
alignments, demonstrates that the 
predicted ORF is just an homologue 
of the already described 
beta-galactosidase of A. niger. 
Title: strong similarity to 
beta-galactosidase lacA - 
Aspergillus niger 

extracellular/ secretion proteins" 

/ citation= [ 9 ] 

/codon-start=l 

/protein-id="CAK42143 .1" 

/db-xref="GI : 134081888" 

/db-xref="GOA:A2R3X5" 

/translation="MKTSFLLAIGLAVEACLGLV 

SAPNYVRQINATDSSLQDIVTWDE 

YSIRVRGERILLLLGEFHPFRLPCPGLWLDVFQK 



VRALGFSAVSFYVDWALLEGERGS 

IRADGVFALEEFFQAATEAGLYLTARPGPYINAE 

VSGGGFPGWLKRVQGRLKTTDQGY 

LDAI TP YMQAI GRI I AKAQ I TNGGPVI LFQPENE 

YTACVQDEGYTQKEY^4AYVEEQYR 

KAGIWPFIVNDADPMGNFAPGTGVGAVDIYSFD 

DYPLQWSTAPSNPSNWSSLISPLL 

SYNETVHEEQSPTTPFSISEFQGGVPDAWGGVGI 

ETSAAYIGPEFERIFYKINYGFRA 

AIQNLYMIFGGTNWGNLGHSGGYTSYDVGAAIAE 

DRQVIREKYSELKLQSNFLQASSA 

YLETHSDNGSYGIYTDATSLAVTRLAGNPTNFYV 

VRHGE L T S RE S T S YKLRVNT S AGN 

LAIPQLSGSLSLHGRDSKIHLVDYNVGNVSLIYS 

TAELFTWKQAGSKSVWLYGGEDE 

LHEFAVPANKGKPTSIEGDGLQVQQINSTTVIQW 

AVQP S RRWHF S D T L E VH L L WRNE 

AYNYWVLDLPVPGAIGRHVSRSHTNRSVIVKAGY 

LLRTAEIIGTSLYLTGDINTTTTI 

ELISAPQPVTSILFNKNRIPTTITSPGRLTGTLT 

YHKPNISLPDLTTLDWYYLNTLPE 

VHDPTYDDHLWTPCTHTTTANPRNLTTPTSLYAS 

DYGYNGGTLLYRGTFTATGNETSL 

YLLTEGGYAYGHS I WLNNTFLASWPGNPAFLL SN 

QTITFPSPLTPGTTYKLTILIDHL 

GNDENFPANGEFMKDPRGILDYTLHGRDDKSAIS 

KKMTGNFGGESYADLSRGPLNEGA 

LFAERKGYHLPGAPTEQWTKRSPFDGLPEDERPG 

VGFFATKFDLQIPDGYDVPISWF 

ENSTMAGDGSGPARFRSELFVNGWQFGKYVNHIG 

PQLSYPVPEGILNYNGSNYLALTI 

WAMDEKSFKLDGLRLQANAWQSGYRKPSLVKGE 

VYKERVDSY" 



sig-peptide 


159715. 


.159768 


/locus-tag="Anl4g05820' 
/inference= "protein 
motif :SignalP: 2.0" 


mat -peptide 


join(159769. .160332, 


/locus-tag="Anl4g05820' 




160378. 


.160543, 






160593. 


.160722, 






160785. 


.160893, 






160962. 


.162768, 






162857. 


.163071) 


/product= "unnamed" 


exon 


159715. 


.160332 


/locus-tag="Anl4g05820' 
/ number=l 


intron 


160333. 


.160377 


/locus-tag="Anl4g05820' 
/number=l 




160378. 


.160543 


/locus-tag="Anl4g05820' 
/number=2 


intron 


160544. 


.160592 


/locus-tag="Anl4g05820' 
/number=2 


exon 


160593. 


.160722 


/locus-tag="Anl4g05820' 
/ number=3 


intron 


160723. 


.160784 


/locus-tag="Anl4g05820' 
/number=3 




160785. 


.160893 


/locus-tag="Anl4g0582 0 
/number =4 


intron 


160894. 


.160961 


/locus-tag="Anl4g0582 0 
/number=4 


exon 


160962. 


.162768 


/locus-tag="Anl4g05820' 
/number=5 


intron 


162769. 


.162856 


/locus-tag="Anl4g05820' 



/number=5 

exon 162857. .163074 /locus-tag="Anl4g05820" 

/number=6 

gene complement (<163244. .>16 /locus-tag="Anl4g05830" 

5037) 

mRNA complement (<163244. .>16 /locus-tag="Anl4g05830" 

5037) 

CDS complement (163244. .1650 /locus-tag="Anl4g05830" 

37) 

/note=" unnamed protein product; 
Function: co-expression of het-e 
and het-c lead to cell death. 
Function: het-el of P. anserina is 
responsible for vegetative 
incompatibility. Remark: het-el of 
P. anserina shows also two 
sequence motifs, a GTP-binding 
domain and a repeated region that 
shares similarity with that of the 
beta-transducin . Remark: the 
reactivity of the HET-E protein 
depends on two functional 
elements, a GTP-binding domain and 
several WD40 repeats. Similarity: 
the similarity to het-el involves 
only the N-terminal half of the 
predicted protein. Title: 
similarity to beta transducin-like 
protein het-el - Podospora 
anserina" 
/citation=[26] 
/citation=[44] 
/citation=[69] 
/citation=[73] 
/codon-start=l 
/protein-id="CAK42144. 1" 
/db-xref="GI : 134081889" 
/translation="MRLINAKTYQLEEYLDSEIP 
EYAILSHTWGKDEISFQDLQALST 
GSRYWDRWGREGFLKIKYVCQQAIRDGLDYAWV 
DTCCINKSSSAEISEAINSMFQWY 
KRSRKCYVYLSDIPSGYGEVPLDEQAGSSVEPAR 
RRRPSKYDGTINGPFARCRWFTRG 
WTLQELIAPKDLTFYGSGWNVLGTKASMTALIST 
ITGIDQRVLRDSTDTKESLRSVCV 
AQKMSWAARRQTSKIEDEAYCLLGLFDVHMPILY 
GEGKASFVRLQEEIIKSSDDHTIF 
AWNYCDIEQESRYTVLAPSSTCFLKNWQWTCPR 
NSGPYTMTNRGLHIQLPVIYRSGS 
KDSCLALLDCHYEDDVTGALALPLTPRSDNTFDI 
DYDISRGRWRVDLGLETRATTRD 
LYIRSQPNVPDRGPQLTCQFNIDQDESRAATLSI 
TEYYPSRIWDAENHRLRLSSNDST 
SLETWAAVRLHNSSGRAIGWFHISPHRRPESTS 
TFNVHGW I CMQ TMKP DMT L S Q I VD 
QVIRGCASASAYEYGNHGSCTLALSRRHHMVATN 
TFVSGNGSETL INVTVANRYAPCT 
LRRVFGATPKPPKNAETPSKEKDLRPSMSTD" 
exon complement (163244. .1650 /locus-tag="Anl4g05830" 

37) 

/number=l 

gene complement (<166037. .>16 /locus-tag="Anl4g05840" 



7516) 

mRNA complement ( join (<166037 /locus-tag="Anl4g05840" 

. .166398, 

166461. .166917, 

166970. .167082, 

167142 . . 167268, 

167319. .>167516) ) 
CDS complement (join (166037. /locus-tag="Anl4g05840" 

.166398, 166461. .166917, 

166970 . . 167082, 

167142. .167268, 

167319. .167516) ) 

/EC-number= "2.1.1.109" 

/inference="prof ile:COGS:COG0500" 

/inference="prof ile:PFAM:PF0 0 891" 
/inf erence="similar to AA 
sequence:UniProtKB:AF159789 .1" 
/note="unnamed protein product; 
Catalytic activity: A. flavus omtB 
converts demethylster igmatocystin 
(DMST) to sterigmatocystin (ST) 
and 

dihydrodemethyl sterigmatocystin 
(DHDMST) to 

dihydroster igmatocystin (DHST) . 
Function: A. flavus omtB is a 
demethylster igmatocystin 
6-0-methyltransf erase involved in 
aflatoxin biosynthesis. Remark: 
aflatoxins are polyketide-derived 
secondary metabolites. Title: 
strong similarity to 
0-methyltransf erase omtB - 
Aspergillus flavus" 
/citation=[70] 
/ codon-start=l 
/protein-id="CAK42145 .1" 
/db-xref="GI : 134081890" 
/db-xref="GOA:A2R3X7" 
/translation="MFTNSSHANGASQKPVDGLI 
RQVAQAADSFHNSHDEADRLAALQ 
AAQKLVEAFQKPQDAVYNLAYSPTHVMCVRIAMD 
LGIFSTLSERNGPVTLEDLAATKG 
ADLVLTERVLRILAGIGYVAEHDVRVYIPTTMTR 
QMT DRL S GAVN DSFDIGMPT L AKV 
PEFLRMTNFQNPSGAVKGALQYAEKTEMSLWDWV 
PTKPGFLDACNTFMEADRGSRPSW 
LEWFPVKERIIDGFHQGESNVLLVDVAGGRGHDL 
VAFESKFSEVQGRLILEDLPHWA 
EATQHPKIEHVSFDLFQAQPIQGARTYYMKFILH 
DWSDEESRQILSHLAAAMKMGYSK 
LIIEEFVLADKDCAMLPAMWDWEMLIFCNSMERT 
ASQWTKVLDSAGFRWKFWAPPGD 
GQSIIEAELKEPAEPESSRLALSSHI" 
exon complement ( 166037 .. 1663 /locus-tag="Anl4g05840" 

98) 

/number=l 

intron complement (166399 .. 1664 /locus-tag="Anl4g05840 " 

60) 

/number=l 

exon complement ( 166461 .. 1669 /locus-tag="Anl4g05840" 

17) 



complement ( 166918 . 
69) 

complement (166970. .1670 

82) 

complement (167083. .1671 
41) 

complement (167142. .1672 
68) 

complement (167269. .16"^ 
18) 

complement (167319. .161 
16) 

<168480. .>170002 
join(<168480. .169074, 
169128. .169245, 
169293. .169453, 
169507. .169540, 
169597. .>170002) 
join (168480 . . 169074, 
169128. .169245, 
169293. .169453, 
169507. .169540, 
169597. .170002) 



/number=2 

/locus-tag="Anl4g05840" 

/ number=2 

/locus-tag="Anl4g058 40" 
/number=3 

/locus-tag="Anl4g058 40" 
/number=3 

/locus-tag="Anl4g058 40" 
/number=4 

/locus-tag="Anl4g05840" 
/number=4 

/locus-tag="Anl4g05840" 

/number=5 

/locus-tag="Anl4g05850" 
/locus-tag="Anl4g05850" 



/locus-tag="Anl4 



/note="unnamed protein product; 
Function: PTHll of M. grisea is 
required for appressorium 
differentiation in response to 
inductive surface cues. Function: 
as do most fungal plant pathogens, 
M. grisea differentiates an 
infection structure specialized 
for host penetration called the 
appressorium. Phenotype: M. grisea 
cells null mutant for PTHll are no 
more pathogenic. Similarity: the 
similarity to PTHll of M. grisea 
is limited to the N-terminal half 
of the predicted ORF. Title: 
similarity to integral membrane 
protein PTHll - Magnaporthe 
grisea" 
/citation= [63] 
/codon-start=l 
/protein-id="CAK42146 .1" 
/db-xref="GI : 134081891" 
/translation="MPADKGPQITAVSWSFGGVA 
IIWTIRLYTRLILTRKAGWDDFF 
IVLSLVSYPCKSLLYNTSANPDGAWMQLSAIVCS 
GLAQTGVHYGLGKHMADISNAEWK 
lEAFKYTVIAPNFSMVSTTTGKLSVAVFLLRLMG 
QT AS P AKRWF L Y I F S 1 1 S VAWNVL 
AIVAIMGYCRPAEKIWRPEVPGSCFSLKFQLIAG 
I SQASFNAFADLTLALFPIIIFWS 
VQLPWKMKLGVI AVMGAGI LAAAATLVKAI LLKS 
LPAHSDITCEHQIRGRAQTHKKKL 



exon 

intron 

exon 

intron 

exon 

intron 

exon 

intron 

exon 

gene 
mRNA 
CDS 



168480. 

169075. 

169128. 

169246. 

169293. 

169454. 

169507. 

169541. 

169597. 

<170380 
<170380 
170380. 



.169074 

.169127 

.169245 

.169292 

.169453 

.169506 

.169540 

.169596 

.170002 

. .>172035 
. .>172035 
.172035 



170380 . .172035 

<172769. .>173936 
join (<172769 . .172958 
173013. .173080, 
173134. .173513, 
173575. .173639, 
173698. .>173936) 
join(172769. .172958, 
173013. .173080, 
173134. .173513, 



NIELQGLGPILQHGTPNTILSAPLESSGAHPVHR 

QMYVI I ICATLPTLRQSYNFALHR 

TRYLGSYYKGSHSEAAAAAAARQKPIPLLRRQPD 

QSLFETFADETGLVNVASHEDSHV 

SPERGSNHTSITKTTEIEVHEESKSKGDIENPHF 

PRANPFRTDHQ" 

/locus-tag="Anl4g05850" 

/number=l 

/locus-tag="Anl4g05850" 
/number=l 

/locus-tag="Anl4g0585 0" 
/number=2 

/locus-tag="Anl4g05850" 
/number=2 

/locus-tag="Anl4g05850" 
/number=3 

/locus-tag="Anl4g05850" 
/number=3 

/locus-tag="Anl4g05850" 
/ number=4 

/locus-tag="Anl4g05850" 
/number=4 

/locus-tag="Anl4g05850" 
/number=5 

/locus-tag="Anl4g05860" 
/locus-tag="Anl4g05860" 
/locus-tag="Anl4g05860" 
/ codon-start=l 

/product="hypothetical protein" 

/protein-id="CAK42147.1" 

/db-xref="GI : 134081892" 

/translation="MACPPTATCPIISSSLVTKS 

AFKSKETRENGSESESERNSQHIA 

KSTGNANAELQQTGKVKELFQDYGHGRSPESITV 

GGAYAPMSGYFLPSNLLEDQNGPR 

TPKIIAEGLAVPAGKVKGDDSEPVSSAMVTSRMK 

LNQVLGNRRKVGCAASLPAGTAAH 

PVEHGPLAGRSSGVGVSFGAEGDPAFEGSYEDDY 

DDACANHAASDGDREDKEALKNGG 

KTDNTGESEYPRFSQVDYFTQHQSLQIRTTGLPK 

RPRYSAATVGFPCPVTVWAF I DRG 

DPKHGMPTAADLPWEIRTLHETNELNTIAAYLAS 

MRRNLKDLDVNFTFHYRPVSFIDK 

ADRVKKIGGYSSSDKQNAHTVADLYMHTKDDRSW 

GTLCVTREHHVQGMAWHYWAVAAI 

SPPKSPFQSVDEKGVYLLMYDSRPLKEPTKKRDR 

DYWRNYMRRDQYKLLVEIGKNFKV 

LDLAINRRQLSVEGEDDPLRLTLWWLWNIARYGG 

GFYERDGELDDPRWRLTDARWLHF 

DQDWSSNVRTALRWKNVKEWFSEQRLDEQLTHEQ 

RIQAHAVAA" 

/locus-tag="Anl4g0586 0" 
/ number=l 

/locus-tag= "Anl4g058 70" 
/locus-tag="Anl4g05870" 



/locus-tag="Anl4g05870" 



173575. .173639, 
173698. .173936) 



exon 172769 .. 172958 

intron 172959 .. 173012 

exon 173013. .173080 

intron 173081 .. 173133 

exon 173134. .173513 

intron 173514 .. 173574 

exon 173575. .173639 

intron 173640 .. 173697 

exon 173698 .. 173936 

gene <1 75066 .. >1 7651 4 

mRNA join (<175066 .. 175356, 

175420 . .175941, 
176025. .176078, 
176188. .>176514) 
CDS join(175066 . .175356, 

175420. .175941, 
176025. .176078, 
176188. .176514) 



/note="unnamed protein product; 
Function: PTHll of M. grisea is 
required for appressorium 
differentiation in response to 
inductive surface cues. Function: 
as do most fungal plant pathogens, 
M. grisea differentiates an 
infection structure specialized 
for host penetration called the 
appressorium. Phenotype: M. grisea 
cells null mutant for PTHll are no 
more pathogenic. Similarity: the 
length of the predicted ORF is 
only one half of the M. grisea 
PTHll. Title: similarity to 
integral membrane protein PTHll - 
Magnaporthe grisea" 
/citation=[63] 
/codon-start=l 
/protein-id="CAK42148.1" 
/db-xref="GI : 134081893" 
/translation="MTPLNHWGITLVWPIVACV 
VTTLFFILRLYSRRLVRQKLDLGD 
VLMGIGLFFSYGLTLCTVISAFNGVGHEFWSLSR 
PTRGTVTLLFWCSTKFWVLSHVFV 
KFSYIVLLRKLFGAITYWRRLTTALIVFTLAWGI 
AS I F VS I FQC WP VRYF W I KH I DG S 
CMQGRNTF Y I WGS I ALAENF I LVSMPLVWWGM 
NLSPRQKVELSLIFGFGGLVCAIG 
LLRMVTFKRYVTADATTNGYLQAIWSIVELELGI 
I CAS VI LMRP IFQPPLGII QNEML 
HHAWPNIKVQLLFKRRADDEFWMAEAAGLCRDG 
ISG" 

/locus-tag="Anl4g058 70" 
/number=l 

/locus-tag="Anl4g058 70" 
/number=l 

/locus-tag="Anl4g05870" 
/number=2 

/locus-tag="Anl4g05870" 

/ number=2 

/locus-tag="Anl4g05870" 
/ number=3 

/locus-tag="Anl4g058 70" 
/number=3 

/locus-tag="Anl4g05870" 
/number=4 

/locus-tag="Anl4g05870" 
/number=4 

/locus-tag="Anl4g058 70" 
/ number=5 

/locus-tag="Anl4g05880" 
/locus-tag="Anl4g05880" 



/locus-tag="Anl4g05880" 



exon 

intron 

exon 

intron 

exon 

intron 

exon 



175066. .175356 

175357. .175419 

175420. .175941 

175942. .176024 

176025. .176078 

176079. .176187 

176188. .176514 

<176803. .>178484 
join(<176803. .177362, 
177419 . .>178484) 
join(176803. .177362, 
177419. .178484) 



/inf erence="similar to AA. 
sequence : UniProtKB : NCB9 12.2" 
/note="unnamed protein product; 
Similarity: the central part of 
the predicted ORF shows only weak 
similarity to N. crassa B9I2. 20. 
Title: strong similarity to 
hypothetical conserved protein 
B9I2.20 - Neurospora crassa" 
/codon-start=l 
/protein-id="CAK42149 .1" 
/db-xref="GI : 134081894" 
/translation="MPLASDAFQADLSRLSPSHL 
EVSLPVTLSLAVLAYLLLVSTLRF 
QRVRILYRDYPQYTTRASMSQMTVDDAWAIQKNI 
LQLEFPTTAVKALQFALFRTYGIP 
TISSLLLHTSQFSNPATSFKRYADTGALIGQFVT 
CPPTSSRARTAIARTKFLHSGYRA 
SGRILESDMLYTLSLFATEPIRFVERFEWRAMTE 
LERCAIGTYWKSLGDALDISYDEL 
PSGKSGFSDGLHFLEELRQWGDHYEEENMRPDHR 
NRLVADKTMDVIVYGFPKWHILCD 
GFCKEPCADS LNQEVF S DTPNEDGRYYVQ I WKGM 
PYYVQPTMWNRWGPAAWLTWALGL 
PLPGDDGDTYYPRGFDVADLGPRQFEGRGRKSVG 
EYVDVLEKEHKGQCPFGGRSDQPE ICLSR" 
/locus-tag= 
/ number= 
/locus-tag= 
/number=l 

/locus-tag="Anl4g05880" 
/number=2 

/locus-tag="Anl4g05880" 
/number=2 

/locus-tag="Anl4g05880" 
/number=3 
/locus-tag= 
/ number= 
/locus-tag= 
/number =4 
/locus-tag="Anl4g05890" 
/locus-tag="Anl4g05890" 

/locus-tag="Anl4g05890" 

/EC-number = " 5.2.-.-" 
/inference= "profile: COGS :C0G4 7 16" 
/note="unnamed protein product; 
Remark : CLA compounds ( 
cis, trans) -9, 11-linoleic acid and 
(trans, cis) -10, 12-linoleic acid 
are recognised nutritional 
supplements and effective 
inhibitors of epidermal 
carcinogenesis and forestomach 
neoplasia; furthermore CLA has 
also been shown to prevent adverse 
effects caused by immune 
stimulation in chicks, mice and 
rats, and has been shown to 
decrease the ratio of low density 
lipoprotein cholesterol to high 



^"Anl4g05880" 
^"Anl4g05880" 



^"Anl4g05880" 

"Anl4g05880" 



exon 

intron 

exon 

gene 

mRNA 

CDS 



176803. .177 
177363. .177 
177419 . .178 



density lipoprotein cholesterol . 

Remark: the linoleate isomerase is 

used in a method for producing 

conjugated linoleic/linolenic acid 

(CLA) from oils such as sunflower 

oil, safflower oil, corn 

oil, linseed oil, etc. Similarity: 

the N-terminal part of the 

prediction ORF shows also weak 

similarity to some 

oxidoreductases . Similarity: the 

patented protein, as well as the 

predicted ORF, show strong 

similarity to the 67 kDa 

myosin-crossreactive streptococcal 

antigen of Streptococcus pyogenes, 

a protein with unknown function, 

but involved in the pathogenesis 

of streptococcal infections. 

Title: strong similarity to 

linoleate isomerase protein 

sequence PCLA591 from patent 

WO9932604-A1 -Lactobacillus 

reuteri" 

/citation=[19] 

/ codon-start=l 

/protein-id="CAK42150 .1" 

/db-xref="GI : 134081895" 

/db-xref="GOA:A2R3Y2" 

/translation="MSSISAPPTPPPERRLPEGM 

DAWILGAGISSLTAAVHLIQEAHV 

PPSRVHILETLSQAGGGSISTGDPVNGYDCRAGG 

MPPFNDVCMEELLSLVPSKTNPNL 

TVLEEFHEFWDTEAVKDHPHTRFLTRHKHGLERI 

DAKRASLGLRDRVDLFMLASKSDK 

SLGRSRICDHFSNSFFKSYYWMLLSTTFGIKPVH 

SAAEFRRYLQHYMHDIHEIHCRRK 

LDGGRYNRHESIVSPIAHFLCSRGVDFRFHTTVT 

DIITTPSSSEPHRVSAIKAIHENE 

PEMTINLGERDIVLVSLGSVMSGSTTGTNTSPPS 

LELMDIEKDLDENWLLWLELSTKN 

PIFGNAYNFCTRMAESRLESFTVTFSSPEFFNRF 

TALTGDKVGSGTFVTLKDTPWLLS 

INLPQQPLFPDQPAHVQVLWGYAMYPEREGDYIK 

KPMLECSGQE I MEE I LKQLNFPVQ 

GILDHSITLPCWPRAAATLLVRLRSDRPPVIPP 

G I DNLGL I GQF VD I PGE VAVTMD Y 

GVRSAQTAVRQLMGLERHKVLSKRSSAINLKAL" 

/locus-tag="Anl4g05890" 

/number=l 

/locus-tag="Anl4g05890" 
/number=l 

/locus-tag="Anl4g0589 0" 
/number=2 
18 /locus-tag="Anl4g05900" 



complement 
1260) 

complement ( join (<179356 /locus-tag="Anl4g05900 
. .179403, 
179482. .> 
complement 
.179403, 
179482. .18 



181260) ) 

t ( join (179356 . /locus-tag="Anl4g05900" 



/inference="profile: COGS :C0G3 122" 

/inf erence="similar to AA 

sequence :PIR:T49 456" 

/note="unnamed protein product; 

Similarity: other BLASTP hits are 

explained by the relatively high 

content of low-complexity regions. 

Title: strong similarity to 

hypothetical protein B14D6.80 - 

Neurospora crassa" 

/codon-start=l 

/protein-id="CAK42151 .1" 

/db-xref="GI: 134081896" 

/translation="MPEVIDLLSSTPPPPLERRE 

QLSIRRAHSHPPWPASDSIFSDE 

I DLSAFT YDNP I EQPAKRRRL SHE I TPVKDVP LP 

PAKNSLFLFSDDDIILSSDGPAPT 

KPPAWNGESDPIVFTSSAPEPTHNLPVRPAISSR 

AVNTISLDDDDAGNSSDTRSNNRP 

RKEAIQVSSDQLDFPDVDDLLDVAQRVEANAFSS 

RTANLLATLEDRSRGSNGESASRS 

SRGRRIQDQSDDIEVEEMPPPRKPQRKAIKVTSE 

EKEAKAREREAAKAQREHDRQLEK 

ERKQKAKEDKAREKQLAAD I AEVNKLKVDKKD S T 

HEMLIDLASTFTDTSLGNQTSELM 

RLLKVDLSFFPSTIPNIVKWRRKVRATYNDSLGH 

WEPCALHIRDEEEHVLCLIAAQDF 

INMI VAPTTEPRNT INDHVERLKLAYPKSRP I YL 

I EGLT S LMRKNNNAQNRAYQAAVR 

RQYEDSSARPSTRRKQQQPEPVTPIDSDTVEDAL 

LDLQVTHSCLIHHTSSPAESAEWI 

KNFTEYISTVPYRRERMDLNDSAFCMDVGQVKPG 

ENRSDAFVKMLQEVNRVTASMAYG 

lATQYPSAMDLVNAMRKHGPGLLEDVKQTKTAPS 

PIHALVQP" 

complement (179356 . .1794 /locus-tag="Anl4g05900" 
03) 

/number=l 

complement (179404. .1794 /locus-tag="Anl4g05900" 
81) 

/number=l 

complement (179482 . .1812 /locus-tag="Anl4g05900" 
60) 

/number=2 

complement (<181479 . .>18 /locus-tag="Anl4g05910" 

2990) 

complement (join (<1 8 1479 /locus-tag="Anl4g05910" 
. .182086, 
182177. .182601, 
182623. .>182990) ) 

complement (join (181479 . /locus-tag="Anl4g05910" 
.182086, 182177. .182601, 
182623. .182990) ) 

/EC-number= "2.4.1.-" 
/inference= "profile: COGS :COG0 43 8" 
/inference= "profile :PFAM:PFO 053 4" 
/inf erence="similar to AA 
sequence :PIR:S6 4069" 
/note="unnamed protein product; 
Function: ALG2 of S. cerevisiae is 
involved in N-glycosylation, 
converting 



gene 
mRNA 



man [2]glcnac-pp-dolichol to 
man [3]glcnac-pp-dolichol. Title: 
strong similarity to 
mannosyltransf erase Alg2 - 
Saccharomyces cerevisiae" 
/citation= [3] 
/citation= [15] 
/ cocion-start=l 
/protein-id="CAK42152 .1" 
/db-xref="GI : 134081897" 
/db-xref="GOA:A2R3Y4" 
/translation="MPPAKITIIHPDLGIGGAER 
LIIDVALALQSRGHPVTIYTSHRD 
KSHCFEEARDGTLDVQVRGNTIFPAHVGGRLFVL 
MAILRQLHLTWDLLCGETTSGDNG 
EGEEEVFIVDQMPACVPFLKVRKQRILFYCHFPD 
QLLARRDEGGSVLQLLKGLYRVPF 
DWFEGWAVSASDKWANSRFTRGWSGVFGREKV 
GDLSWYPCVDTKAGDGGEGWKD 
GEKLWGGKKI L L S VNRFERKKDLALAI RAYHGLG 
EEKRKGTRLWAGGYDNRVQENVQ 
YHRELDELATGLGLQTATSKTVI SALS IPDS I DV 
LFLLSVPTAFRDTLLLQAKLLLYT 
PINEHFGIVPVEAMRAGVPVLASNTGGPLETIVE 
GETGWLRDAKVDADWTAVMDKVLY 
GMKQEELDRMSVAAKERVEKEFSLTAMGEKLEQE 
lEEMLGQEQRPFHGFQQLLTFLAL 
VG WL AL AAAF L VKL L " 
complement (181479 . .1820 /locus-tag="Anl4g05910" 



complement (182087. .1821 

76) 

complement ( 182177 . .1826 
01) 

complement (182602. .1826 
22) 

complement (182623. 
90) 

complement (<183872 . .>18 

9440) 

complement (join(<183872 
. .184411, 
184472. .188524, 
188581. .188756, 
188813. .189136, 
189281. .>189440) ) 
complement (join(183872. 
.184411, 184472 . . 188524, 
188581 . .188756, 
188813. .189136, 
189281. .189440) ) 



/number=l 

/locus-tag="Anl4g05910" 



/ number=l 

/locus-tag="Anl4g05910" 



/number=2 

/locus-tag="Anl4g05910" 



/number=2 

/locus-tag="Anl4g05910" 



/number=3 

/locus-tag="Anl4g05920" 



/locus-tag="Anl4 



/locus-tag="Anl4g05920" 



/inference= "profile: COGS :C0G5 059" 
/inference="prof lie: PFAM:PFO 0225" 
/note="unnamed protein product; 
Function: kinesins are 
microtubule-dependent motor 
proteins, involved in organelle 



transport, in mitosis and meiosis, 

and. in the transport of synaptic 

vesicles along axons of animal 

neurons. Similarity: the closest 

homologues, comprising XCENP-E of 

X. laevis, are involved in driving 

congression of chromosomes to the 

metaphase plate. Similarity: the 

main feature of the predicted ORF 

is to contain, in the N-terminal 

half, several kinesin-like 

domains, explaining the strong 

similarity to several different 

kinesin-like proteins. Similarity: 

the predicted ORF has just half of 

the length of XCENP-E of X. 

laevis; the best homologue, having 

also a corresponding length, is a 

N. crassa hypothetical protein. 

Title: strong similarity to 

kinesin-related protein XCENP-E - 

Xenopus laevis" 

/citation=[41] 

/codon-start=l 

/protein-id="CAK42153 .1" 

/db-xref="GI : 134081898" 

/translation="MTGSQPESPADAPQRPFSGI 

MRTPRSTSRLSMSSRHGGGSRASD 

EDGKTAVKVAVRVRPPLKPTDPGYELVPQRFQRP 

MVHVTNP T S VA I DVPQGRKLF VF D 

RVFAETVDQDGVWDYLSDSVSSFLQGYNVSILAY 

GQSGAGKSYTMGTSGPSEQSDPRS 

MGIIPRAAQLLFEKLEGPAKPNRNSGTGLRTPSR 

YSVGSASSFGKASVEKNWQLKATY 

VEIYNEQLRDLLVPESTHQGDRGTVTIREDAKGR 

IILTGLHQVNINSYEDLMGALSFG 

SSIRQTDSTAI N AK S S R S H AVF S LNL VQRKAS NG 

VMSPTPKDKRMSMPVDMMSGSDAS 

VMVDSKLHFVDLAGSERLKNTGASGERAKEGISI 

NAGLAALGKVI SQLSSRQAGAHVS 

YRDSKLTRLLQDSLGGNAYTYMIACVNPAEFHLS 

ETVNTVQYAQRARAIQSKPRIQQI 

ADESDKHAVIERLKAEVAFLRQQLRNAEENGRRS 

AAPQDRAERQNEREVELQNQLLDT 

QESYNALSQRHAKLISEIARDSEHAGETDPNDW 

SLVGKTSVERLKRSQSFAESIEQV 

VLEYEKTIQSLESSLSNTRSSLSVTESTLLERET 

KC AYVE T VN S Q LQAR IQKLLDRES 

STETYLHELEARLDGQSTGEEKQAAIVAELRKEL 

SRARESEANCEDYISTLEERLAEA 

DQDMELMQREMERLEHVIERQRSLGKLDNLLYEL 

DHVQQNGNQKE QSEDELET HVP VP 

AKGAYKPRTRATSLSLDVLTEAVETAIPESDEGL 

TEPAPEAVHE AS VE AE AT AE T DE T 

NLKVLESATDRLEAQENGARASRASTPTQTKVVA 

DKLETVTQELFDLRMQHESTVSEY 

EMLEAKYAEAMKALAEFQRDAADEARHPDEKVQD 

LLSTNVESRPVSFLEEGKAPGSND 

GKQPSSSPSLSSELSLAGEPASSHEQSTLSNGEV 

PQENHVDTRE I DEAKAQEVEQMRR 

LLMEHQEGVSIMSQKYAQLQSEHEGTLSLIETLK 

AELQRSKNSSPPSTPGFKSPVIRR 



gene 
mRNA 



complement (183872. .1844 
11) 

complement (184412. .1844 
71) 

complement (1844 72. .1885 
24) 

complement (188525. .1885 
80) 

complement (188581. .18 
56) 

complement (188757. .18 
12) 

complement (188813. .18 
36) 

complement (189137. .18 
80) 

complement (189281. .18 

40) 

complement (<190148..>19 
1691) 

complement (join(<190148 
. .190283, 
190763. .190861, 
190948. .191206, 
191652. .>191691) ) 
complement (join (190148. 
.190283, 190763. .190861, 
190948. .191206, 



KTSQSLIGTVDRAHRSLAALRNIAVEEFEARPDT 

MQNFEVHLDSAMHELHNRMERIQA 

LEAENQSVKKEMETKSTIISGLTRERSSLQGGGG 

SSVDMGLVNQLRDQWQQENLINE 

MKESHDAREKQLLAEIEELKSLLKTQEEAAKAQD 

LCAEEQERKI S SLEGEVTELKSKH 

HNAVESLQSSEQELSATLAELDKALASIDAMRSE 

QTAAGEASASKDAAARELEAEREQ 

QEELVAKLKHVIDEHKATNAAHLEKIASLEKSHG 

EAQLQLSELLAAKDNDSNEVQVHQ 

SRVSELEKEIDSHKSLADSFKKDLESLQESHKQE 

VTELEARATAAAQGDYESRFATMS 

AEHEEAMKTLRSEILESREELTKLLNMVSKLLNA 

DVT AE TMAEQ I QE I MAQKQHF SDK 

YAEMLDTNEDLRKQLETKGSDEGRL 

EELMQSNSSKEAKVNELALLVATLEDTLLQKEE 

QVKKKEAIIAEVKAEKEKSVRLVEELEEQITNSF 

DQHHNRL S VI QQERDQALEDAKVK 

lAAYEKDIETYRVRIEQLEIKNQDSSHDRSSSIT 

SNLRKSSSATSLPSPPPAIPLPPL 

PTIASATNGTGSISPPSSRHTSKELVNPQIVDDQ 

EARIRTIEKHLNAEKQLTATLEEA 

LGDLEAQSNKVKSDCDAWKKKARELEEELTTLRK 

ERASQRLSLQAVEEERNARREAEA 

ARAQLEERMNALNKKKKKSTLNCF" 

/locus-tag="Anl4g05920" 

/number=l 

/locus-tag="Anl4g05920" 
/number=l 

/locus-tag="Anl4g0592 0" 

/number=2 

/locus-tag="Anl4g0592 0" 
/ number=2 

/locus-tag="Anl4g05920" 
/number=3 

/locus-tag="Anl4g05920" 
/number=3 

/locus-tag="Anl4g05920" 
/number=4 

/locus-tag="Anl4g05920" 
/number=4 

/locus-tag="Anl4g05920" 



/number=5 

/locus-tag="Anl4g0593 0" 



/locus-tag="Anl4g05930" 



/locus-tag="Anl4 



191652. .191691) ) 



gene 
mRNA 



complement (190148. .1902 
83) 

complement (190284. 
62) 

complement (190763 . 
61) 

complement (190862. 
47) 

complement (190948. 
06) 

complement (191207. 
51) 

complement (191652 . 
91) 

complement (<192 418 . .>19 

3985) 

complement ( join (<192418 
. .192525, 
192596 . . 192700, 
192794. .192853, 
192940. .193251, 
193625. .193690, 
193783. .193884, 
193959. .>193985) ) 
complement (join(192418. 
. 192525, 192596 . . 192700, 
192794. .192853, 
192940 . . 193251, 
193625. .193690, 
193783. .193884, 
193959. .193985) ) 



/note="unnamed protein product; 

Similarity: the similarity to NAB2 

of M. musculus is weak, and the 

gene structure of the predicted 

ORF is questionable. Title: weak 

similarity to NGFI-A binding 

protein 2 NAB2 - Mus musculus" 

/citation=[25] 

/citation= [30] 

/codon-start=l 

/protein-id="CAK42154.1" 

/db-xref="GI: 134081899" 

/translation="MSLSDRAGLGGWADLLLLFN 

QCKEASGGWQKSDAGAGWSSAGE 

SPAAFQSETSKNDKNGVPYPHWGKGGTPDSVSV 

PELQSIRREPWNNFCRQYGNRSSP 

PIASVYDLDKPQIAIQGPSATIDSHQVGIMSAVA 

GRPSFSTSHRPLVSTSVQLNIVKR 

PTWANAPAQPHPGSLQP" 

/locus-tag="Anl4g05930" 

/number=l 

/locus-tag="Anl4g05930" 



/number=l 
i /locus-tag="Anl4g05930" 

/number=2 
) /locus-tag="Anl4g05930" 

/number=2 
i /locus-tag="Anl4g05930" 

/number=3 
; /locus-tag="Anl4g05930" 

/number=3 
; /locus-tag="Anl4g05930" 

/number=4 
) /locus-tag="Anl4g05940" 

i /locus-tag="Anl4g05940" 



/locus-tag="Anl4 



/note="unnamed protein product; 
Similarity: very weak similarity 
to a fragment of protein SEQ ID 
NO:917 of patent n. WO200042171-A1 



gene 
mRNA 



complement (192418. .1925 
25) 



complement (192526 . . 1925 
95) 



complement (192596. .1927 
00) 



complement (192701. .1927 

93) 



complement (192794. .1928 
53) 



of Eucalyptus grandis. Title: 

questionable ORF" 

/codon-start=l 

/protein-id="CAK42155.1" 

/db-xref="GI: 134081900" 

/translation="MVECGMGVRAWAFCDEREI 

SRPGYNTFIIPSFLPMSLNVKIES 

GQNGGTFDDRY S AGD I RADCRNNVLAAGEEKL RY 

LQKFMQYLSQRRITFGRISLSPGS 

LAINEPLNPILWVMVLLIDRSVNRLGSRYSSIHF 

SVLDNKAGRLCCYDDIGIMRFYSW 

LQKCITSMEDLVYRIYIVGLVRNLHAMNVMCTLK 

GWALPWIRSPTLNELYAFIASQLE 

GRVMDTRTQLGRWLYKIALCHPSLLSFIFETVSM 

HLWPEVE" 

/locus-tag="Anl4g05940" 
/number=l 

/locus-tag="Anl4g05940" 
/number=l 

/locus-tag="Anl4g05940" 
/number=2 

/locus-tag="Anl4g05940" 

/number=2 

/locus-tag="Anl4g05940" 



complement (192854. .1925 
39) 



complement (192940. .1932 
51) 



complement (193252 . . 1936 
24) 



complement ( 193625 . . 1936 
90) 



complement (193691. .1937 
82) 



complement (193783. .1938 



/number=3 

/locus-tag="Anl4g05940" 



/number=3 

/locus-tag="Anl4g05940" 



/number=4 

/locus-tag="Anl4g05940" 



/number=4 

/locus-tag="Anl4g05940" 



/number=5 

/locus-tag="Anl4g05940" 



/ number=5 
/locus-tag="Anl4 



complement (193885. .1935 
58) 

complement ( 193959 . . 1939 
85) 

complement (<194969 . .>19 
6478) 

complement (join(<194969 
. .195817, 
195865. .196221, 
196455. .>196478) ) 
complement ( join (194969 . 
.195817, 195865. .196221, 



/number=6 

/locus-tag="Anl4g05940" 



/number=6 

/locus-tag="Anl4g059 40" 



/number=7 

/locus-tag="Anl4g05950" 



/locus-tag="Anl4g05950" 



/locus-tag="Anl4g05950" 



196455. .196478) ) 



gene 
mRNA 



complement (194969. .1958 
17) 

complement (195818. .1958 
64) 

complement (195865. .1962 

21) 

complement (196222. .1964 
54) 

complement (196455. .1964 
78) 

complement (<197258 . .>19 

8879) 

complement ( jo 
. .197342, 
197402 . . 197552, 
197606 . . 197698, 
197751 . . 197968, 
198017. .198238, 
198288. .198371, 
198435. .198525, 
198598. .198694, 
198745. .>198879) ) 
complement (join ( 197258 . 
.197342, 197402. .197552, 
197606. .197698, 
197751 . . 197968, 
198017. .198238, 
198288. .198371, 
198435. .198525, 
198598. .198694, 
198745. .198879) ) 



/note="Similarity : a short stretch 
of amino acids of the predicted 
ORF shows some weak similarity to 
an A. thaliana hypothetical 
protein . " 
/codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK42156 .1" 

/db-xref="GI : 134081901" 

/translation="MFGTNRTDTTPSVLYLTESL 

MEIPCLSPAVADDIVPDLAQIFND 

NPFKSLREQKISVPVLCGPHEQGPTDTHIDIFGQ 

EFRTHSSLIKEYTGIWKERQRGEY 

AERRVHYFEARTEKDGGWGIYRKGAVALTPQRYY 

RDYPEIPDERIPSAIESFHCFLQA 

LYLKPVPYVGIKGWRRMVLLGIRFQAVPRVREYI 

EKIILPWTLTSKPLLGLTDDILAY 

TKMAQDIQCAQLYRECLIHLVGMAQYWGRNEFHN 

AEHILTPSAYASLIRHVGTQRQLI 

EKADVALHTFIRTLATQKPRRIPNFLLRQFTDTV 

DN I TGLNGRNMNS LGYYQH I RLVL 

NDMQLRIPRGSTYNMCHRLWQFIGCLCVNGLLYT 

QDQIKGYPVCI SPTDIELPWVLAN 

APEAPVEDQVRDAIMEG" 

/locus-tag="Anl4g05950" 

/number=l 

/locus-tag="Anl4g05950" 
/number=l 

/locus-tag="Anl4g05950" 

/iocus-tag="Anl4g05950" 
/ number=2 

/locus-tag="Anl4g05950" 
/number=3 

/locus-tag="Anl4g05960" 



(<197258 /locus-tag="Anl4g05960" 



/locus-tag="Anl4g05960" 



/EC-number="3 .5.3.11" 



/inf erence="prof ile : COGS : COGO 0 1 0 " 

/inference="prof ile:PFAM:PF00491" 

/note="unnamed protein product; 

Catalytic activity: H20 + agmatine 

<=> urea + putrescine. Complex: 

agmatinase of E. coli is a 

homodimer of the speB gene 

product. Function: agmatinase of 

E. coli is part of the second 

putrescine biosynthetic pathway; 

it also represents the only 

pathway for urea biosynthesis in 

E. coli as no urease is present. 

Similarity: the predicted ORF 

shows much stronger similarity to 

putative agmatinases of N. crassa 

and S. pombe. Title: strong 

similarity to agmatinase speB 

-Escherichia coli" 

/citation= [4] 

/citation=[10] 

/ codon-start=l 

/protein-id="CAK42157.1" 

/db-xref="GI : 134081902" 

/db-xref="GOA:A2R3Y9" 

/translation="MYLSWSLLALSGIVTAHSS 

HDQTPLAGPHQRLVvYNTLPGDGGT 

QADSVFSGISTFGRLPYFPCLSSEAERYDIAFIG 

APFDTGTSYRPGARFGPSGIRQGS 

RRLNLYGGYNVPLQANPFVSDLRVLDCGDIPVTS 

YDNAWAIQQIEEGHNSVLMRKPFT 

DAEKYGLSKAGKTLPRIITLGGDHTITLPLLRSI 

NRAYGPVTVIHFDSHLDSWKPKVF 

GGSPSQVAAINHGTYFYHAAMEGLLKNDTNIHAG 

IRTTLSGPSDYENDGYCGFEIVEA 

REIDTIGTDGIIKKIRERVGTENPVYLSIDIDTL 

DPAYAPATGTPETGGWSTRELRTI 

IRGLDGLNFIGADIVEVAPAYDTNAELSTMAAAD 

VLYEVLTIMVKKGPLSVGRSDEL" 



sig-peptide 



complement (198823. .1988 

79) 



/locus-tag="Anl4g05960" 



mat -peptide 



complement (join(197261. 
. 197342, 197402 . . 197552, 



/inf erence= "protein 
motif :SignalP: 2.0" 
/locus-tag="Anl4g05960" 



197606 . . 197698, 
197751 . . 197968, 
198017. .198238, 
198288. .198371, 
198435. .198525, 
198598. .198694, 
198745. .198822) ) 



exon 



complement (197258. .1973 
42) 



/product="unnamed" 
/locus-tag="Anl4g05960" 



intron 



complement (197343. .1974 
01) 



/number=l 

/locus-tag="Anl4g05960" 



exon 



complement (197402. .1975 
52) 



/number=l 

/locus-tag="Anl4g05960" 



/number=2 



gene 
mRNA 



complement (197553 . 
05) 



complement (197606 . 
98) 



complement (197699 
50) 



complement (197751. 
68) 



complement (197969. 
16) 



complement (198017. 
38) 



complement (198239. .1982 



1976 /locus-tag="Anl4g05960" 



/ number=2 
/locus-tag= 



/number=3 
/locus-tag= 



/number=3 
/locus-tag= 



/number=4 
/locus-tag= 



/number=4 
/locus-tag= 



complement (198288.. 
71) 

complement (198372. . 

34) 

complement (198435. . 
25) 

complement (198526. . 
97) 

complement (198598. . 
94) 

complement (198695. . 
44) 

complement (198745. . 
79) 

complement (<199830 . 
1412) 

complement ( join (<15 

. .200758, 

200827. .>201412) ) 

complement (join (195 

.200758, 

200827. .201412) ) 



/number=5 
/locus-tag= 



/number=5 
/locus-tag= 



/number=6 
/locus-tag= 



/number=5 
/locus-tag= 



/number=7 
/locus-tag= 



/number=7 
/locus-tag= 



/number=8 
/locus-tag= 



/number=8 
/locus-tag= 



.>20 

9830 



/number=9 
/ locus-tag= 



"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05960" 

"Anl4g05970" 
"Anl4g05970" 



330. /locus-tag="Anl4g05970" 



/inference="profile:COGS:COG0534" 
/inference="prof ile:PFAM:PF01554" 
/inf erence="similar to AA 
sequence :PIR:T3 751 7" 
/note="unnamed protein product; 
Title: strong similarity to 
hypothetical protein SPAC11D3.06 - 
Schizosaccharomyces pombe" 
/codon-start=l 
/protein-id="CAK42158 .1" 
/db-xref="GI : 134081903" 
/translation="MYDSLPSYRETSSAHTHEEE 



complement (199830. .2007 

58) 

complement (200759. .200! 
26) 

complement (200827. .201' 

12) 

<202677. .>204146 
join(<202677. .203229, 
203273. .203374, 
203431. .>204146) 
join(202677. .203229, 
203273. .203374, 
203431. .204146) 



HTPLLPKQVDPDPDSKSAKSSVSF 

LVEFFRLLKDSIPVILAYTLQNSLQTTSVLIVGR 

TSPENLATTAFSLMFAMVTAWMIA 

LGGTTALDTLASSTFTGSSNKHDLGILLQRAFFV 

LGLFYVPVAILWTCSEPVFLLLGQ 

DPQLSRDSARFLTCLIPGGLGYIYFEAMKKYLQA 

QGIMRPGTYVLLITVPFNALLNYL 

FCYTFRMGLLGAPFATGISYWLSFALLVLYARFI 

AGSECWGGWSRKAFENLGTFARLA 

FLGWHVGTEWWAFEIVALAAGRLGTIPLAAQSV 

IMTADQVLNT I PFGVGVAT S SRVG 

SLLGSRDAAGASRAANTAAWLSMALGGAVLAVLM 

GTRHVFAKI FN S DEGWQLTAEVL 

PWVALFQIADGLNGSCGGSLRGMGRQHVGALVNL 

ASYYCGALPLGIWLAFNGWGLKGL 

WVGQCIALYLVGALEWTIVAFSNWEGEVDKAFQR 

MDIHDRLEVGHTTNGATTW" 

/locus-tag="Anl4g05970" 

/number=l 

/locus-tag="Anl4g05970" 
/number=l 

/locus-tag="Anl4g05970" 



/number=2 

/locus-tag="Anl4g05980" 
/locus-tag="Anl4g05980" 



/locus-tag="Anl4g05980" 



/inference="profile: COGS :COG0 477" 
/inf erence="similar to AA 
sequence :PIR: 86 11 40" 
/note="unnamed protein product; 
Function: TP03 of S. cerevisiae is 
a vacuolar polyamine transporter, 
that controlls the cytoplasmic 
spermine content. Phenotype: S. 
cerevisiae cells in which TP03 was 
disrupted showed an increased 
sensitivity to polyamine toxicity 
and a decrease in polyamine uptake 
activity and polyamine content in 
vacuoles. Phenotype: S. cerevisiae 
cells overexpressing TP03 were 
resistant to polyamine toxicity 
and showed an increase in 
polyamine uptake activity and 
polyamine content in vacuoles. 
Remark: TP03 of S. cerevisiae is 
also called YPR156c or P9584. 7. 
Similarity: the predicted ORF is 
125 amino acids longer at the 
N-terminus than TP03 of S. 
cerevisiae. Title: strong 
similarity to polyamine transport 
protein Tpo3 - Saccharomyces 



/ codon-start=l 



exon 

intron 

exon 

intron 

exon 

gene 

mRNA 



202677. .203229 

203230. .203272 

203273. .203374 

203375. .203430 

203431. .204146 

complement (<205094. .>20 
7843) 

complement ( join (<205094 
. .207273, 
207361. .207431, 
207517. .207554, 
207700. .>207843) ) 
complement (join(205094. 
.207273,207361. .207431, 
207517. .207554, 
207700. .207843) ) 



/protein-id="CAK42159 .1" 

/db-xref="GI : 134081904" 

/translation="MNLKLTQIDLVPVPEKQSAS 

DIDSLPVEQHGTHTHPAPTSDPLD 

PLNWPRWRKHVILGIVMLKFVLPEQSRRTHRLTM 

NRYFLFTYITTTTVPSFAEIQSQY 

DINYSQVNWTVAIPALGLSLGPLFWSSVGDIYGR 

RIVFIVGTVIALVATIGAAVADTY 

GGYMAARFFQGFGVSPSSTVGMAVANGTVTDMFY 

EYERGQKLGLWVLALDSGLLLGPT 

FGGFLNLVSAQWINWFNAILFAALLLLELTLMPE 

TLYPRALMLQRMAVTEKPAESNAG 

lEEAGIKRTKSLPFFNLRPIPGLSHPPIYASLTR 

FLLTFRFPVIAVAVIGYSFTWYWW 

ILSVITMVPSAYATDSPLIQGLLFLGLLIGTLVA 

EVSCSGRLSDAIVGRLAKRNGGVR 

VPEMRLWLAYPAIVLTAGVYIPIAHVRTISNIAI 

QPAVGLILWGISIDKAYHWMVGQV AFFLCE" 

/locus-tag="Anl4g05980" 

/number=l 

/locus-tag="Anl4g05980" 
/number=l 

/locus-tag="Anl4g05980" 
/number=2 

/locus-tag="Anl4g05980" 
/number=2 

/locus-tag="Anl4g05980" 
/number=3 

/locus-tag="Anl4g05990" 
/locus-tag="Anl4g05990" 



/locus-tag="Anl4 



/note="unnamed protein product; 

Title: weak similarity to spindle 

pole body-associated protein sadlp 

- Schizosaccharomyces pombe" 

/ codon-start=l 

/protein-id="CAK42160 . 1" 

/db-xref="GI : 134081905" 

/translation="MRNLVYPKYHVLESLNRPCD 

AVDRHSTRVFSRRVAFALEGRQLP 

RIAIDTSELVILVAAIVALASPALDLCLRLQASY 

SVPSSHTNRNIKPILLRIELPMPA 

RRGATRRAGSTRSDIGSASTYFQSKLGPEARTQA 

LPNLPTKQSFAYGSAETPILPREL 

KIQPHMDLTEMADAIDKGIEDAKDRQMKEKETTQ 

DKSRRQKSPS I TRSPVRRSRREPT 

PDELQLLDNLREATKSPTPVRGNYSNNDQSTATP 

TPPIPHTLSTASSPAQSLPVPRYP 

HVPAENLYPSPMGRFGPQLHDGPPLGSSPLPDDS 

SLYSFTVERAINSDELTRTLSDGK 

NIKAPPRRFSGLAFANEPIHEEEEPDSRLLKTKS 

RSPSLQPSYEDFQIEPSPEPEPQS 

EPESVQELELEPTPEPEPIPELEPMPEPTPEPEV 

IREKSPAAQFTAPTKTLIPNAYAR 



gene 
mRNA 



complement (205094. 
73) 

complement (207274. 
60) 

complement (207361 . .2074 
31) 

complement (207432 . 
16) 

complement (207517. 
54) 

complement (207555. .2076 
99) 

complement (207700 . .2078 
43) 

complement (<208451. .>21 

0658) 

complement ( join (<2 08 451 
. .208726, 
208778. .209568, 
209634. .209834, 
209889. .210489, 
210572 . .>210658) ) 
complement (join(208451. 
.208726,208778. .209568, 
209634. .209834, 
209889. .210489, 
210572. .210658) ) 



RTPSQEPSVDDGQQNIRQTGQSWSWVGSLSAQLP 
S VS TVARI LAG I ALAAATVYLVAF 
GGIPSLSRPPQYI PMDENNML AVS SLTDQMSRIG 
AQVS S L AKEMRT VKWDVNE VQ S E V 
RSSPTPIMPPSRGSTDLGPPTEQKTNFLSIGLGV 
IVIPGLTSPTVGHKLSAWQWAYVN 
LWRGSHYRPAS PP L AAL VP WE D YGDCWC S TPRDG 
MSQIGIDLGQKIVPEEVAVEHMPK 
TATLKPENAPREMELWAQYVLVQKGTSRPARTQA 
ERFSIHKPIMDALRSAWPTEDPTA 
YSDDPLLGPTYYRVGKFTYDIHGSHHVQRFELDA 
VIDSPEVRVDRWFRATSNWGGNH 
TCIYRLKLFGHV" 
2072 /locus-tag="Anl4g05990" 



/number=l 
/locus-tag="Anl4 



/number=l 
/locus-tag="Anl4 



/number=2 
/locus-tag="Anl4 



/number=2 

/locus-tag="Anl4g05990" 



/number=3 

/locus-tag="Anl4g05990" 



/number=3 

/locus-tag="Anl4g05990" 



/number=4 

/locus-tag="Anl4g06000" 



/locus-tag="Anl4g06000" 



/locus-tag="Anl4g06000" 



/inference="profile:COGS:COG2319" 
/inference="profile:COGS:COG5635" 
/inference="prof lie: PFAM:PFO 0400" 
/inf erence="similar to AA 
sequence :PIR:S56 893" 
/note="unnamed protein product; 
Function: MDVl of S. cerevisiae is 
involved in the mitochondrial 
membrane fission event by 
regulating the assembly of Dnmlp, 
a dynamin-related GTPase, into 
punctate structures at the outer 
mitochondrial membrane. Phenotype: 
the mitochondrial fission is 
blocked in S. cerevisiae MDV null 



complement (208451 . 
26) 



complement (208727, 
77) 



complement (208778 , 
68) 



complement (209569 
33) 



complement (209634, 
34) 



complement (209835, 
88) 



complement (209889, 



complement (210490 , 
71) 



complement (210572 , 
58) 



<211445. .>212528 



mutants and mitochondrial 
membranes form nets. Remark: is 
also called YJL112w, NET2, FIS2 or 
GAGS. Title: strong similarity to 
mitochondrial fission protein Mdvl 
- Saccharomyces cerevisiae" 
/citation=[77] 
/citation=[78] 
/codon-start=l 
/protein-id="CAK42161 . 1" 
/db-xref="GI : 134081906" 
/translation="MDKHRRRDESPSGLSDIVEP 
DGLLGTGITSRHIEAFGRKVTSTA 
GHLMGPAPDSSTGGHYHTAMADIQRELRHPNTQR 
KVFSLTQTTPTDLVRSKLSTTEIQ 
SRAISSLPDELLANIPDDSSSYSLFQGFQASQDD 
lEYRRAHRRRSSKSKKLLKDGETR 
GALPSAPSDLKKERDLLSRRMELMGVRKNMCSSE 
IHDIDNKIANLHNMRKIVLDRLAG 
LEMEEADLEHELNEIENKLEDIQEEQQEAEVPPP 
ATPKSSEANDDSIVSEDPAMGASF 
MSESIYQKIPSPKSVKQRSIILHEHFAPGSEIKE 
MPAHSDMVTAI DFDYPFGTMI SAA 
LDDTVRVWDLNVGRCVGFLEGHNASVRCLQIEDN 
IVATGSMDASVKLWDLSRARTTTR 
DIIRVTRREEJEJEESAQADDASMASHSTTLEDCYVY 
SLDAHVDEVTALHFKGDTLISGSA 
DKILRQWDLVKGRCVQTLDVLWAAAQASTLGSET 
TWRPSGRLPDASADFVGAVQCFDA 
ALACGTADGMVRLWDLRSGQVHRSLVGHTGPITC 
LQFDDVHLVTGSQDRSIRIWDLRT 
GSIFDAYAYDKPITSMMFDTKRIVAAAGENWKV 
YDKADGHHWDCGAGVGVDDSGPQP 
ATVERVRLKDGFLVEGRKDGIVAAWTC" 
.2087 /locus-tag="Anl4g06000" 

/ number=l 
.2087 /locus-tag="Anl4g06000" 

/number =1 
.2095 /locus-tag="Anl4g06000" 

/number=2 
.2096 /locus-tag="Anl4g06000" 

/number=2 
.2098 /locus-tag="Anl4g06000" 



/number=3 
.2098 /locus-tag="Anl4g06000" 



/number=3 
.2104 /locus-tag="Anl4g06000" 



/number=4 
.2105 /locus-tag="Anl4g06000" 



/number=4 
.2106 /locus-tag="Anl4g06000" 



/number=5 

/locus-tag="Anl4g06010" 



exon 

intron 

intron 
exon 
gene 
mRNA 



join(<211445. .211515, 
211743. .212394, 
212451. .>212528) 
join(211445. .211515, 
211743. .212394, 
212451. .212528) 



211445. .211515 

211516. .211742 
211743. .212394 
212395. .212450 
212451. .212528 

complement (<212729 . .>21 

3754) 

complement ( join (<212729 
. .213283, 
213342. .213578, 
213671. .>213754) ) 
complement ( join (212729 . 
.213283,213342. .213578, 
213671. .213754) ) 



/locus-tag="Anl4g06010" 



/locus-tag="Anl4g06010" 



/EC-number="5 . 4.99.5" 
/ inference= "profile : COGS: C0G16 05" 
/inf erence="prof ile:PFAM:PF01817" 
/inf erence="similar to AA 
sequence:UniProtKB:AF133241 .1" 
/note="unnamed protein product; 
Catalytic activity: chorismate 
mutases convert chorismate <=> 
prephenate. Function: aroC of A. 
nidulans is an allosterically 
regulated chorismate mutase acting 
at the first branch point of 
aromatic amino acid biosynthesis 
to generate prephenate. Induction: 
tryptophan acts as heterotropic 
activator of aroC from A. 
nidulans. Repression: tyrosine act 
as negative acting, heterotropic 
feedback-inhibitor of aroC from A. 
nidulans. Title: strong similarity 
to chorismate mutase aroC 
-Aspergillus nidulans cytoplasm" 
/citation=[60] 
/ codon-start=l 
/protein-id="CAK42162 .1" 
/db-xref="GI : 134081907" 
/db-xref="GOA:A2R3Z4" 
/translation="MDTAIDLSDASKALDLANIR 
FQLIRLEDTITFHLIERAQFPLNK 
PIYLSGGVKIPGTDLSLLDYFLREQERLESRVRR 
YQSPDEYPFFPDSLEEPILAPIKY 
PKILHDNDVNVNSTLKQRYIEDILPAVCAQFGRE 
DRGEAQENYGSAATCDVHCLQALS 
RRI HFGKFVAE AKFQQETERFVKL I KANDRKG I D 
DAITKPEVELKVLERLALKAKTYG 
TDPGFQPENGPKINVDAWAMYKEYVIPLTKWE 
VDYLMQRLKGTQWE " 
/locus-tag="Anl4g06010" 



/ number 
/locus-tag="Anl4g06010" 
/number=l 

/locus-tag="Anl4g06010" 
/number=2 

/locus-tag="Anl4g06010" 
/number=2 

/locus-tag="Anl4g06010" 
/number=3 

/locus-tag="Anl4g06020" 
/locus-tag="Anl4g06020" 



/locus-tag="Anl4g06020" 



exon 

intron 

gene 
mRNA 



complement (212729 . .2132 
83) 

complement (213284. .2133 
41) 

complement (213342.. 2135 

78) 

complement (213579 . .2136 
70) 

complement (213671 . .2137 
54) 

<214408. .>214606 
join(<214408. .214444, 
214506 . .>214606) 
join (214408 . .214444, 
214506. .214606) 



/inf erence="similar to AA. 
sequence :PIR:T41 474" 
/note="unnamed protein product; 
Similarity: the ORF shows 
similarity to putative cdk 
inhibotor p21 binding protein 
TOK-1 of H. sapiens. Title: strong 
similarity to hypothetical cdk 
inhibitor p21 binding protein 
SPCC613.08 -Schizosaccharomyces 
pombe" 

/ codon-start=l 

/protein-id="CAK42163 .1" 

/db-xref="GI : 134081908" 

/translation="MVKRKELKDNDVEMSGTDPR 

VDGDDSDEDMDMVNVDFEWFDPQP 

AVDFHGLKNLLRQLFDTDAQIFDMSALADLILSQ 

P L LG S T VKVDGNE S DP YAF L T VLN 

LQEHKDKPVIKDLTAYLQRKANAVPTLAPLAQLL 

SQTPIPPIGLILTERLINMPAEW 

PPMYTMLQEEIEWAIKDKEPYNFSHYLIVSKTYE 

EVE S KL DAEE S RPQKKKKKAAGGE 

KAERFLFHPEDEVLERHAVCVGPVEYTHKAEEGL 

S DAKRAFQDLG I VTKGS L I LLEAS 

KLDGAVKDMAE YFKP " 

/locus-tag="Anl4g06020" 

/number=l 

/locus-tag="Anl4g06020" 
/number=l 

/locus-tag="Anl4g06020" 

/ number=2 

/locus-tag="Anl4g06 02 0" 
/ number=2 

/locus-tag="Anl4g06020" 



214408. .214444 

214445. .214505 

214506. .214606 

complement (<2 16 128 . .>21 
7084) 

complement (<2 16 128 . .>21 



/number=3 

/locus-tag="Anl4g06030" 
/locus-tag="Anl4g06030" 

/locus-tag="Anl4g06030" 

/note="Remark : the predicted ORF 
is only 45 amino acids long." 
/codon-start=l 

/product=" hypothetical protein" 

/protein-id="CAK42164.1" 

/db-xref="GI : 134081909" 

/translation="MPFAQNGDAVTKADYQVEEG 

IKYLREVKTTLHPFWKAPSVGSY T" 

/locus-tag="Anl4g06030" 

/ number =1 

/locus-tag="Anl4g06 03 0" 
/number=l 

/locus-tag="Anl4g06030" 
/number=2 

/locus-tag="Anl4g06040" 



/locus-tag="Anl4g06040" 



complement (216128 . .2170 
84) 



/locus-tag="Anl4g06040" 



exon 

intron 

exon 

gene 

mRNA 

CDS 



/inference="profile: COGS :COG0 599" 
/inf erence="similar to AA 
sequence :PIR:T39 71 4" 
/note="unnamed protein product; 
Similarity: the predcited ORF is 
96 amino acids longer at the 
N-terminus than the hypothetical 
protein SPBC17D11. 03c of S. 
pombe. Title: weak similarity to 
hypothetical protein SPBC17D11 . 03c 
- Schizosaccharomyces pombe" 
/codon-start=l 
/protein-id="CAK42165. 1" 
/db-xref="GI : 134081910" 
/translation="MARNPSRLLPPSRSLQLTTS 
KRQFGTGASILPLLSPPRYRPLRS 
SLISSTPSSSPLAHRIPTFATPLRTITTSSNSAP 
TKMTTTIDPKYAQLFQSLESQFQT 
TTLPHDKWYILAISTLVANPDPERADQLYLYLTS 
KPEYSTPSSRQDLIRRIREALIKS 
VIIVGVCKPIEAILAISKLEAPEDKDYTFTRENW 
QCDEANHERGVAWLEKLYARNTSG 
TLDLFAAHKDFAWLSKEITYGLFLSDRGVLDDLD 
TQLWLPAIMSQNLKIETHWHIRG 
TRRLGVSLEDVRVLCEGVKHVAGFYGRVLDKVPS 
VEEVEGDV" 

complement (216128 . .2170 /locus-tag="Anl4g06040" 
84) 

/number=l 

<217593. .>218104 /locus-tag="Anl4g06050" 
join(<217593. .217931, /locus-tag="Anl4g06050" 
218003. .>218104) 

join(217593. .217931, /locus-tag="Anl4g06050" 
218003. .218104) 

/note="unnamed protein product; 
Title: weak similarity to 
hypothetical protein B10H18.060 - 
Neurospora crassa" 
/codon-start=l 
/protein-id="CAK42166 .1" 
/db-xref="GI : 134081911" 
/translation="MSSSGAGQSSVGSRAIYEAG 
DQRNVPQSEINEQNRYAEGQKNSH 
KNLDSSMPPSSLLPSNHSGDIVLTNTKTEDQRSI 
GNKLASQERKPESDHHHNFDQNPE 
AELSKQDPTKPAKVHGNEPSKGAKIDAELQAEDE 
QRLREKGIKK" 

/locus-tag="Anl4g06050" 

/number=l 

/locus-tag="Anl4g06 050" 
/number=l 

/locus-tag="Anl4g06050" 
/number=2 

/locus-tag="Anl4g06060" 



217593. .217931 

217932. .218002 

218003. .218104 

complement (<218542 . .>21 
9708) 

complement (<218542 . .>21 
9708) 

complement (218542 . .2197 
08) 



/locus-tag="Anl4g06060" 
/locus-tag="Anl4g06060" 



/inf erence="similar to AA. 

sequence :UniProtKB: SC05211 . 1" 

/note="unnamed protein product; 

Function: TTPl of S. cerevisiae is 

involved in the transfer of 

mannose residues to the mannans of 

glycoproteins during maturation in 

the Golgi. Function: the correct 

mannosylation of proteins by TTPl 

of S. cerevisiae is essential for 

growth and morphogenesis. Golgi 

Phenotype: the N-linked sugar 

residues in the S. cerevisiae TTPl 

null mutant lack the main 

alpha-1, 2-linked branches of the 

mannan. Remark: TTPl of S. 

cerevisiae is also called mnn2 or 

YBR015C. Title: strong similarity 

to alpha-1 , 2-mannosyltransf erase 

Ttpl - Saccharomyces cerevisiae" 

/citation=[43] 

/citation=[56] 

/citation=[57] 

/codon-start=l 

/protein-id="CAK42167.1" 

/db-xref="GI : 134081912" 

/translation="MEESHAGYIEECRKAERLRP 

VHTPGTRGIVSTAGASYFPVFLSS 

LRMLRRLGSTLPVEVYMKDKSEYEKQICDDILPD 

LGARCLVLSDIVGKGAIEHYQLKI 

FAVLFSSFEEVIWMDADCFPLHKPEVLLESEPFT 

SKGLVTWPDFWISSASPLYFTISR 

QEMP AL S ERAS S E AGVF L VS KKTHQMT L L L AAY Y 

NYYGPSHYFRLLSQGAPGEGDKET 

FLHAASAVGEPFYAVSERVQAVGHTKPGGIAGSA 

MVQTDPAEDYALTSAGKWRVQDES 

VAKAPRAFFIHANYPKFNPGEKVFGMKWETTPTL 

RPDGTDGRAWL VAE S TVQRFGYDV 

EKAYWEE I KN I S CDPAI SFRTWERKDEVCDRVES 

YWANVFAE PHDDDPKFTDES" 



gene 
mRNA 



exon 




221930. .224173, 
224489. .224622, 
224729 . .224737, 
224926. .224987, 
225102. .225280, 
225490. .225530, 
225672. .225708, 
225858. .225905, 
225959. .225993, 
226055. .>226148) 



CDS 



join (221804.. 221887, /locus-tag="Anl4g06070" 



221930. .224173, 
224489 . .224622, 
224729. .224737, 
224926. .224987, 
225102. .225280, 
225490. .225530, 



225672. .225708, 
225858. .225905, 
225959. .225993, 
226055. .226148) 



exon 

intron 

exon 

intron 

exon 

intron 

exon 



221804. .221887 
221888. .221929 
221930. .224173 
224174. .224488 
224489. .224622 
224623. .224728 
224729. .224737 



/inf erence="similar to AA 
sequence : UniProtKB : NCB7F2 1 . 6 " 
/note="unnamed protein product; 
Title: similarity to hypothetical 
protein CAD11325.1 - Neurospora 
crassa" 

/codon-start=l 

/protein-id="CAK42168 . 1" 

/db-xref="GI: 134081913" 

/translation="MEPTSKARIRKTPPVSAPAP 

RKRRRRTVGTPCDRRRPFCSQCLD 

SARNCAGYKTALTWGVGVASRGKLRGLSLPVTGA 

QPAVAPPKLPRATGPRAVSKSQPA 

LDTAAEKRRLHDKQRSSAAGLTEHGSVSIPPTSQ 

RRLCSPEYYTVDSHVKPNHHNPYS 

SRATKTDSLKQYHAPLVANPDSSRAWMGLEQTPV 

PIQCPWPALTPEQKDNASDEQHEE 

TSYNEVSPRMSACQPPSLSQQLLARSVGRTPRLR 

YLISYYAEVIAPMIVAFDSPTNPF 

RTYILRLAQESTSLQEAIATLASCNLRQRRERGT 

RSTERTLPGRLSSMAHQALTDGAL 

QDRTGIFMPEDYAQEEQYHRGMAVSALNRELADS 

HQRLSDSVLATLLILCLFHGCDTG 

VAQFKTQFAGVTKLLAIRLCNSPCMSDELKWFVR 

MFTWIDTMTATTNDREVQLRGACL 

DITAASDGEWGLENLAGCDAGLFRIVAQLGRLNL 

LSQNQEVRTSTPPDIHIPSTTLPP 

SMAFFPSNIPATSSGPFAFTLPPLPPPGDRDRGQ 

LPPAFWAEWYSLRQKLESWRFVPH 

QSQSQSPLFTSSHAYISPPSSPTDQPWAAHNLK 

DVIHISESFRHAAILYSERLAYPD 

LPSDHPRIQNIVQHIMSHIMTVQSDAYLLWPLFI 

TGSECVRADHRAIIRQRCKDLSRD 

SGFFNNLSCLQLLEKIWAENPAVDVIPWSSLDF 

GLQAAASGCVSEVTFLPPGSWASS 

QHEAFMSPTLPSVSRKQGFRWHEAMQTKRAEDEY 

MTNSIDDASTPSGPSSNLPHQPTQ 

TNHKSSGTGHLPSASTSENLCWRMSGNSSGSGSD 

GIATGGLNITIYIRFRHPHHRIDG 

RKIETHKRIAKRDSDSSPQDKRTDLQRDPNLLLL 

LRIEDEGFTLHHQLNKGWGVGLG 

WRLRTLIHCGGGGGGMGIRDDFIKMIWRLLVFW 

RCDFESVAESQCNIYYGGKYSNND 

IILLFHEVPKFRWYNM" 

/locus-tag="Anl4g06070" 

/number=l 

/locus-tag="Anl4g06070" 

/number=l 

/locus-tag="Anl4g06 0 70" 
/number=2 

/locus-tag="Anl4g06070" 
/number=2 

/locus-tag="Anl4g06070" 
/number=3 

/locus-tag="Anl4g06070" 
/number=3 

/locus-tag="Anl4g06070" 
/number=4 



intron 224738 .. 224925 /locus-tag="Anl4g06070" 

/number=4 

exon 224926. .224987 /locus-tag="Anl4g06070" 

/number=5 

intron 224988 .. 225101 /locus-tag="Anl4g06070" 

/ number=5 

exon 225102. .225280 /locus-tag="Anl4g06070" 

/number=6 

intron 225281 .. 225489 /locus-tag="Anl4g06070" 

/number=6 

exon 225490. .225530 /locus-tag="Anl4g06070" 

/number=7 

intron 225531 .. 225671 /locus-tag="Anl4g06070" 

/number=7 

exon 225672. .225708 /locus-tag="Anl4g06070" 

/number=8 

intron 225709 .. 225857 /locus-tag="Anl4g06070" 

/number=8 

exon 225858. .225905 /locus-tag="Anl4g06070" 

/number=9 

intron 225906 .. 225958 /locus-tag="Anl4g06070" 

/number=9 

exon 225959. .225993 /locus-tag="Anl4g06070" 

/number=10 

intron 225994 .. 226054 /locus-tag="Anl4g06070" 

/number=10 

exon 226055. .226148 /locus-tag="Anl4g06070" 

/number=ll 

SEQUENCE (SEQ) : 

1 tttcgcgtcg tcgttaggca tcatgatggg ggtcgaaaat gcctctggtt ccactccgga 
61 agctaaccat caccccagag cacagttcct acaccgtaga ttcagttgct acgggaaatc 
121 ttgaacagaa gttcttagca agacccaggt agtagcagct aattactatt agtagtatta 
181 agcttcagca acaccaagga aagaaagaat gtaaaaagaa aagctacctg cccgcatgaa 
241 tgaaccacca gcttgtaaca ccaggatcat tttatcgctg tattttctag aagccatggc 
301 cacaattgcc tattgtcatt tccactttgt atagtactag caagtatcat catcaccggg 
361 aagcatcctg agtggatggt gaggcgacta ctatgatgat caccggacag cattggggga 
421 agatgatcaa catcaatgct tgccccgagc gtggggagat gccttgaacc aactaaaact 
481 aactatttcg gatcaatcac ctgtcaacca tccatcatcc tcctcctcct cctccaccat 
541 catcatcatc atcatcatca ctacaatcca gactagtgcc ataccaatga atcaatccct 
601 caaaaagtac aggcacagaa tggctgattt atatttcgaa ttccagtcat cgggatctgt 
661 cacgggtaca gccgtgttgc ccgtggcact actaactatc tgattgagag tatctgctga 
721 cagggataga ttagttagta atgaaacttg atgggcgatg aaattgcatt aaattccacg 
781 ctgagacctg ttgcgacaca tttacccctg cgtctgtctg tgcgtgcttt tgaaaccata 
841 cccaacaccg cgggtttatc tactgtgcat ctacatactc aatatatttc tcagagtact 
901 acatgccacg ccagggagat acacacagaa aagagaggga gagggagaat gtgagtgagt 
961 gattcattgt gtgttgccac cacatagacc acgtgagaga ccatgataaa aaaggagtct 
1021 tcttataatt cacaccatcg aaaatggctt gcatcacatg aaggcccggg cccatgatcc 
1081 ggtgccaatc actggcgggc tgaaccatcc attcgacaac ctcagagagg cttactcagt 
1141 aagcaaatga tgtctctctc tttctctcgc tgccccgcaa agctaaatgt caatgggcag 

12 01 cgcaaaagtg cggaggagcc cacgagatcc ccgtccatag cacggcttgg tatcgattca 
1261 tccttccttt tgaatggtcc catgtggatt gctcaggaca gatctgcccc cttccgtgcg 
1321 tcggttgtga cgcagtcgta cagattcaga tatatacaaa agtagtgtat ttctctgagg 

13 81 gatacttgga gactatcccg gtctctacgc agtatgcagg cgatggaatg ctgggggaaa 
1441 ggagaggccg gggcctgggt tgccaccgtc agacagaggc tcctggaata gcctgcagaa 
1501 aggatgttcc atgttccgga cgttttattt gatctcagac cgggaaaaac ggttgccaga 
1561 gagcatcggc attgcttttt tttgtttttt cgctctgcga ttcggggggc caatttgata 
1621 cttgctccac tccccggagt tacttaagat gcctgcagtc aatcaaccga cctgcagtta 
1681 gagaacaact ccgaacggga agaattttct attcctaggg gtggatatag atactactcc 
1741 gtacttcgtg acgaactgac ttgcgagttg acgctaaatg gaagcccagt ggaatcttct 
1801 caggcgagca accatggtag cccgtgtttc ggggggcgca ggctcactcg tccaactaat 
1861 aacaggaccc tcactgtaat tcgccgctac gccctggatg gatggagttt ctggatagac 



1921 atggtgcgat gcgcgccccg gctcctctcc tctcctctca tctccaagta tcaaggctgg 

1981 tccctcttcg aatggggatt cggtcgtctc tagttcgtag aatagtacga cttgctgcag 

2041 acctgcgcat tccgtttgcc ccacagagag aagatcttac catttgactc gcatggttgc 

2101 cattctgttc cggcattcca accccaaggg agaaagaaga agaagaagag aaatagcaac 

2161 actttcccac cagcctcgac ttggatattt ttctcctccc tctcgctatt ttcggcgcgc 

2221 aatttatctt ttagttcctc cctttccttt tcccatccct tcttgtccct tagaactggg 

2281 tcaaggataa tttttttttt ttccctttta attttccatc accccctaca tcactggccc 

2341 tgatcgtctg ccctgattcc caaagagacc cagatttgcc tccggatcca ccttggaacc 

2401 attgcaatgc gccgaagcga cgaacaactc taaagctcac tcatctatcc ccgacttaac 

2461 ttgaacggaa gaattcgagg actgtcagaa aaataagatc ccaatatagg gctgtcgcgt 

2521 cgatcgcacg aggtgacaga caaccagaaa gtggacaagg acagccagca gcatagcgca 

2581 ttcgagaagc gatcgtctcg tggactttct tccccgatca gctaactaac ttaacaccga 

2641 ccgagttgtc actctcgcaa aagttgataa gagagaggag ggagagaaag agagaaagga 

2701 aacaaaaaga aaaggatctt aaaaagactt gataaatccg atcccttccc actggttaag 

2761 gaagttaatc acgcgaagca aaccccgaac aattattcac tttttctttt cattttcttt 

2821 tctttttttt cctcctcccc atcccaaacc catgtctgtg tctgcatccg tttccgcctc 

2881 cgcctcctcc aacctcccat ccgattctcg ctcctcagga cccctccgtc gcctgagcca 

2941 actgcgtgcc tatactcagc agcatttctc ctcttcatcc tctgctaccg cctcctcctc 

3001 gcacccttct gcctcgtcat cctcgtccaa cggtccccgc atctcgcgtc gtcataccct 

3061 cagcagtcga gtttcctggt tttctcccgc ctccaccggg accgaatctt cgcaactctc 

3121 gcctgtgggg tctcgctcgt cgggcccatg tcccgagagt gagcagccat ccactcttgc 

3181 ccgctatagt gcggtttttt tccccagcta ccgcggcctt gagctcgact tgcgttcctc 

3241 ccgccagtcg gacctgtcca gctcttcccc ttctaaccac tccggtggcc gtgagacgac 

3301 aactcaaggc acgaccatgg cgcggcccag aggtgcctct caggcgctgg atgcgagacc 

3361 tgatctcgat gctaccactc catcggcctc caatctccac gccggagctt tggactcgtc 

3421 cgacccggtg atgtcggaga acccgtcgac tgcgtccccg cgaccgaagc agaaagcgac 

3481 cattcgcttc ttcccccatc aagactccca ccaaagctcc cgcccttcct tacctttcat 

3541 ccccgtctcg cgaacgcttc cgtccgaaag ttgcgtcatc cgagttggtc ggtactcgga 

3601 acgcgatggt ctgccggtgg ccaaccccac ggacccctct gatgctccgg tcgggttcaa 

3661 gtcaaaggtc gtgagtcgga aacactgcga gttcttatac ctgaacggcc agtggcatat 

3 721 caaggatgtt ggaagctcgt cggggacgtt tctcaatcac atgcgcttga gccagccgaa 

3 781 catgccctct cggctgtata ccgtgaagga tggagatatt gtgcagctgg ggattgactt 

3 8 41 tcggggaggc gaggagatga ttttccgttg cgtccgtata cggatcgagt gtaatcggtc 

39 01 ctggcagcag cagccgaatg aattcaagta tgaccatctc tggcgtcgaa gctgctttgt 

3961 gtggaatacc ggctaatgag tgatctagta aaaacaccga gagtctgatt cggaatctgg 

4021 gcaagggaga tacagcggat tattctggat gtcgggagtg ctccatttgt cttggctctg 

4081 ttctggtttg ttctccctcc tctggcagat cgcacaactg cgggcagcat agctaattct 

4141 gtttgaagcg gccttatcag tgtttattca tggcagcctg tgcccacgtc tggcattaca 

4201 aatgcgttag tcgtcttatc cacacgccag actaccccat gttccaatgt cccaactgcc 

4261 gtgcctacac cgatttgagc gccgaggtcg atgatacgaa cgactttgac gaggaggacg 

4321 aaaagaagga cactccagag gacaagcagg ataccaccga tgcatctcgg tcacaaacga 

4381 attcccccca gttggaggcc caaccagccc cttctggcga cgaaccccgc caggataatc 

4441 tgccggccga ggcaggcctc gctgccaata ttgagagcat gcgtctgcaa gacacagatg 

4501 catccgatga tgcgcggcgt ccccgcgctc ccacgtccaa taatgatggc gcctctgcca 

4561 atgccgataa tgaagtccct ggctggcagt ccgtgaccca gtcccccaat gctctgcaga 

4621 cccgtcagtc ccacctgcgt gcggacaccc ctgtccggtc ggagtcgtct gatgacaatc 

4681 ctctgacccc attgaatgat tcagggcccc ttgcccttga cggccgggcc gccatgcctt 

4741 gaagcaatga ggagcgtggc cgcgacatgt ttggatagat cgctccgagc tcctcctcga 

4801 cagctgtatc tggcttctag aattggacaa cccttgcatc cagtctactg gactgagtca 

4861 ttaatctgtg cattcctatt gtcgccttgg tttgatgtct tttgtctttt gcattctcga 

4921 gccgtatgtt catgttccat tgttattagc ttcttcttta gtcccagcat tgagcatcgc 

4981 gattgttctt cccccttgct tccccattta ttctctaaca ccccacctat tcaacctatt 

5041 ttatacccat ttgcagcatc ctttctccct cacacacccc ccacttcttc cctctcggca 

5101 ctggctactt caccatcaac gcaatggacc accctatcac ccatgccatg ccatgccttt 

5161 cgaactcaac cccggcttgg agtacacaac gggagtcatc ttatttggaa cccggaggtt 

5221 ggcgtttgca gtcagtcagt cagtggtagc cacccccctt cttttattct gtctttggtt 

52 81 tttttcggaa cggtgattca ttggcgcgca tgaatgggtg gttgtgagat gtacatacca 

5341 gctatacata tactctatca tatttttatt tcataaccta cacctataca tcatcatcat 

5401 ccttgtctca atttaggcag cctcctgcaa gatcagtttc cttgtcccca tgacacgagg 

5461 tgacatgaaa acgcagcatt catgatgctt ggcggaaggg ctgttcaacc cacctaactc 

5521 actaaccaat tcgtcacctt caatggtact attagtagta tctcatccag gtagataata 

5581 ttagtattac catacagtag taccatacca taccactcat cctgaatgag taactaagca 



5641 tctatttcct aaagtgttct atatcaccat gtaaaaatac ttatctccga aatactgcta 

5701 cttagttatg gcgtagtgtc cgttacacac gtcatggacc agattactta cgtactgagt 

5761 gagtagttcc gataacagtt tttatttacc tgctttacca ggctctttac tttttaatca 

5821 cgtggtatat agtcaccccc tcccttttga attggatcag tatcacctag ttaactagta 

5881 gtacttacta gtacgttccc gcacacatat gcataatatg tataactagc aatcgcttag 

5941 taccgagaac ggtgctgaca ttttttgagt gtataacatg actttcattg aattagttca 

6001 acatgtttga aaatacatag tagtccttac tattacgaat gactgttgta agagtactac 

6061 tagcattata gaatagatat tactttcaaa ctttgaatca agacgtaagc aaggcaaaag 

6121 taagtccgaa aacatttctc ttttcatatg atgaattgat ttatttattc ctcactcatc 

6181 aatctgataa tagtacaacc agtcattatg tcacataaga aaaaaacaca aaaagcaatg 

6241 agtatttcgc tggtatcaat catctcatct catatcatgt tcaggcctag tcaagtctct 

6301 cctggcccac attctttaaa tgtcatcatg ggatattcat caacgcaagc atttatgttt 

6361 gcccgcttct ctaataaaca tcatctctga gctcaactaa gccctcaaaa cccacgctgg 

6421 gtgcccaggg tcccgcggtt atcgaggtcc gagtcgacgc ggggatcgag cttgttggcc 

6481 atgttcgagc tgtgaggacc ggcagtggtt tgaccgggag cgttgtagct gctgctggcc 

6541 ataccctggt gacgggcgcg gttgtccatg tctgagtcga cgcggggatc catcttgttg 

6601 gccatgttgg agccgtgagg gcctgcgttg ctgctgccac cgtagccgct gctggtggtg 

6661 tagtcgttca cgccgctgcc gccgtagcct gcaccgccgc catagcctgc acctgcaccg 

6721 gcacctccgt agccagcgcc agtagtgtgg gtaccggagt agccgccagc ggcacggttg 

6781 tctgggttcg gtgttagtat ggtgttagat ggtataaagg tggatgggtt agggtgtacc 

6841 tctgtcactg tcgacgcggg gatccatctt gttggccatg ctggagtcgt gagggccgtg 

6901 ggcaccaccc atggcttggt gacgggcacg gttgtcgcgg tcactgtcga cacgggggtc 

6961 catcttgttg gccatgccgg actcgtgggg accggaagag tagccaccgg aggagtagcc 

7021 ggcaccggag ccgcccagac cgccagaggg attggcacgg ttgtcacggt cactgtcgac 

7081 gcgggggtcc atcttgttgg ccatgctgga atcgtggggg ccgtgagcgc cacccatggc 

7141 ttcgtggcgg gcgcggttgt ctagggaatg ttagtatcgt atggagggga acgattggac 

72 01 aagggatagc ttaccgcggt cgctgtcgac gcgggggtcc atcttgttgg ccatgttgga 
7261 ggagtgggga ccagcgttgg aggaggtgtt ggaggtgttg gtgttgctgg actccttgtg 
7321 gtggccagtc acggcatcct taaccttgtg cataagtccg gacatgatgg gcaattgtat 

73 81 gtgtgtttgg tttagagatt gagaataaag tggtcgataa gagagttggt aaagttgata 
7441 gttgtagaca gtagatgtac taagtcaatt attgtgttga tgatgaggat cgagatttct 

75 01 tggagattag ccaggtttaa ataaggagtg ctgagagtgg tgagtgaaga gaggtctggc 
7561 catgacgtag ggatacgtcg agtggccgcg ttcactcctg gacgtgccct ggtccggcaa 
7621 agtgactcga tagcggttga gtgaatgaat gatcgagttc atctccctca ccatagcggg 

76 81 ctggtaggat gacgttggga tgtgctcatg tccctcagcc ctaaagcagc gatgaatgtt 
7741 ggtcctttgt cattgtcctc tgttaagcag catcgaaagc ggcagggaat agcttccacg 
7801 gatgtcgatc gttcctaagc gggacgttgg aggggtgagt ggctccgcat ctttgtgtta 
7861 aggccagact acgtttgtaa gaagtcgtct atttttgtct ccaaccaggc tgtctagcca 
7921 gttactgcct cgagtccaag tctggggttg ctggagatga gcatccagtg aagatgagtc 
7981 agcaaaccat ggcggatgat gacctcagcc agtgccgtcg atgtcgtcgg ccaagccaag 
8041 atgcagggta gttattggct gcagctgtaa aattgcggga atcaggggag agccagaccc 
8101 actcacaccg atccactaac tgcgggtcca gatggtgcct ccgatcgaat ggggatcact 
8161 cttccccctc catctggcat gccagattga gctgcacacc ttatttttta ggattatttt 
8221 agtatcagca ctagccaact gcactggagt acctctgcag tacaagggag tcgtgtctcg 
8281 cacagatatt tggtggcgcg gtctcgtata tggctggcgt aacatgcaca tggtaaagaa 
8341 tcctatagtt cagactcctg ccggcttaca tcacggattg acccgatcca agatgatgac 
8401 caccccgttc cgtgggcgga agcatatcac tgccgttttg atacgattcg gtcgccaagg 
8 461 tctcaaccag gcatgatgca atctgcgtga atcaagctcc tcatagtgct ctgtacgttg 
8521 acgatattcc tactttagca gttcggtctg ttttggacgg aatgtccaag agctctttgt 
8581 cctgttcctg aaattgcttt gctggcaacg ctggggtgat tccttggaag gctggagacg 
86 41 taagcggctg tcagatctac atccgcccgt ccactggagc gcagagttgt gagtggcttg 
8701 catcgctagc agggctggca gaacggtcta gttcgcaaca gcatgtatct ccgtgtagac 
8761 gttcgtcatt taatccgacg taaaatgagg accgtctatt gcaaacaagc taagacagtc 
8821 tgagctcttg gctattctag catcaagcta ggtccagagc agtgtggctg atgtgttacg 
8881 gaattaggcc tggaaaacct cagtgctgcg ggttcaatgc tgggttattg cgccatctta 
8941 gtcttacaat gtctttcggc gacgtcctgg agacgagttc gcttggtgtg gcctctacga 
9001 caggtccctc atggactggc tgtagctcgg gcgttgctct agttgcagcg aagatctcaa 
9061 gttctgtagt ggctgttttt gcagctctag ggtcactagc atccaagagt agtttgatga 
9121 tgcgctcgtt cccatactca gtggcgtaag tgagtggtgt aaaaccatct tggtcctcca 
9181 gacttgagtc cattttgccc gtctcaagca gtggcaacga tcttttcatg accagcttta 
9241 caggcaaagg ccagcggtgc ccgaacatga ttgtctcatg tgctaggatc aatcgccggg 
9301 tactcgagca gtaacctcac gattcaatca tgtccaaagt gcgatgcgtg tgtgagcggc 



9361 gtctgaccac cagcattctg acagtttggg tcaggtattc cggtacggag gagtaacttg 

9421 acaattttgg tgtgccacat ggatgcagca tgcaatagtg gagtgtgtcc agacttgtcc 

9481 ctagaattga ggtctttgac gccaaagtcg agcagtatcc gaataatact ctcgtggcca 

9541 cggtgtgctg ctggtgagat cagattcatg cctcgcttac aagggggtgt tgttcctggg 

9601 tctcatcctt gaagacctag tatcacacca tactcttctc tgaactcggc cggagctcgt 

9661 ctactagata attagccaag gacggatttc ccacagccag gatcggctga aacccagaga 

9721 agacccgacg catggtcttc tctccacttt cggaaataac ggtgattagt gaaccactag 

9781 caggtcccat taatccgttg aggagttcta tccttagcgt ctcgacaatt gcgggtgtgt 

9841 ggccgtttga ggacgttggc tttccactct ggaagaagtt cttcgtgcag tcgcgtatct 

9901 ataataaggt taacttatct catctttaat ggcctcgctt caaatgattc gggctcatac 

9961 gcagtggctc tacgtacgtc agagtatcct tagctaaagc tgcagccgcg cctgcggcat 

10 021 atgcatgcca gtcatcgttc tggtgagagt cacagtaatg caaactgcca cgaatgacaa 

10081 tgcatggaga ctggtccatt agcccggcag ccccatttca cagcatataa caccaccata 

10141 ttcagtggcc atcatgtccc gctttctgct attgtttatc atgcgattac ccgatgctat 

10201 attaccataa aagtcgcacg actcattaga cggcattcgt ggacgccatt gaatggcttc 

10261 ggttttatca caagaagcac agctgcctcc accttcaccg tgggtatagg cggctggata 

10321 caaggtatcc gtttccgaca atggtcgatg atattctctc agtccgggat gattgcgcag 

10381 tatgtccgac agaaacttgc gacgcgattc gaccgaacac ggtgatctga ctgcatcatg 

10441 gtcatcatcc ctctctactc agtcggagga ggacaagata ttcccttcct tctgaacccg 

10501 cttgtgatct ccttccccat atcgtattca acatctcctc ccataatgcc atctggggca 

10561 ctcacaacca cgatccccta gacgaatatc gttgttattg ttgggaatac cccggccaac 

10621 gcccaccaac aacgaagctc gatcgctggg aaagtgcgag tcgggtggat ggcgacagct 

10681 gctgtggatg ccgttccctg tgatcccttt ggaagtgacg cgagcaccac attgtgccca 

10741 ctcaaggatc ccagtacata tgtattcggg tcatatcagt ctctcagaag acatgtatgc 

10801 tcgtcgtcaa gcatatatcg tgctgtactt agtccgacct cggggggcat attaatgcgg 

10861 ccgtgtatgc gccatgccag agacgcagtt ttggaatact aggcacaata tctaatttct 

10921 taatcaaaca gcagataatt cgaacttctc ggcggctatg aaactctgag aggtctggag 

10981 ttccttgtga agtgcggaga ggaaagagat ggcccagttc agcctcgcaa tggtggtttg 

11041 cggacgcttc catgcaccaa tgaacggcaa agctcataga atcaattaga cgaacaatgc 

11101 acgaaccact tggattatga tcgtattcga cgtgaagctg atcctgccgc tttagcaagc 

11161 gtgctagcga ggaggtcaaa catggccgga tctccgccaa atagctttcc aggcggcaac 

11221 ggcctttcag aagagtgagc catagatgcc gaagaaaagc gagcgacttg ttcttctcag 

112 81 attcgaaggc tggctgccag gcggcaagcc gacgaactgc aacgagacga ttggctcccg 

113 41 ctagtagata gtcgcaacaa gaataagtct gtgctactcc taaggcgccg aaacacgatt 
11401 gtttataagt cgaatcattt gaagcatagt attgggcttg aagttctttg aaatcagtag 
11461 tagctcccac gtattagtag tagtcttgac tgtactttcg ataatgagtc caagcttgtc 
11521 cggagactaa atatgtatcc acccatcttc actaatagca cattatgcac tcgaagcccc 
11581 ctttgtgggt gacatgggtt tcatcttttc gtgctatccg agttggagat ctcatcgtgg 
11641 tcaaaccagg tcaaagtacg tggaagtcaa ctggttggcg gagggatgag agagaataga 
11701 gttctgggcg tggcagcccc gtgaagatga attcaccaac cacaacatcc ggcggctggt 
11761 tgatcccccg cgactcgtcg ctactttctg cacgatgatg ccactcctta tctcgtacca 
11821 tttcccaaga gtctcatatg ataatggctt tcgggacaag cgcactaatt ctagagagcc 
11881 cttcagcacc gctgtctcgc cctatcgttc cacctcgact aagcaactaa cccaagtttg 
11941 acaaagcgga cgggctgttg accgctagac gcaaattctt tcacactgtg tacaaagacc 
12001 acctcatctg tttctgtact ctcctccctc agtcagattg aattggaggc aaaatgcgat 
12061 tgaatttcag tcaacccccg acaccggcgt tgagtcggca gaacaccacc ttcctcaaca 
12121 atgcgcatca ctatcgcgca gggacgcatc cgcatctcct gacgagtcgc agcaagcctc 
12181 gaaggtaaac cgatatcatc actcatcaaa ttacacccta ctatagcacc atgctaaata 
12241 cgcgccatgt tagatggcct ttggtgttcc gattcattaa aggcgctatc catgcggcca 
12301 ttctgctgcc cgtcttcctg cacgccgcgt tcacagcctt tgtggtctat ctcgatgtct 
12361 atgtatttga tacggttgga ctaccgagca gtattgtagg tctcgctcac atggcatgcc 
12421 gttgtgcttc ccaatctgat tatgcaaccg tagatcccat ccttatctat agtcgtgggt 
12481 ttgatgctcg ttttccgcaa ccagacctcc tacaaccgat tctgggatgg gcggaatgga 
12541 atgaacacga tctatacgtg cattcgaaat ttagtcagga ctatcgtcac caatggctac 
12601 agcaccgcgg gtcctcccac tgccgcagag aaagaggaca tcgagaggac gatacgtatc 
12661 ctcatgtcaa ttccctttgc agtcaagaat catcttcgcg ctgaatgggg agcagcctgg 
12721 gctctcggaa atgatgtagc tgagaacggt gtcgcagctt tcaaccccga ttatgcaagt 
12781 ctccttccag ctggactggt gggccacgaa gatgaggggc tgggattacc gtttcaacta 
12841 accttctttg tggatggatt tatcaagagg ggagtggagc gcggctggtt caatgctcct 
12901 ggggcaagtc aaatgcaggc tcagttgaac tctttgatgg atgcctatgg aaaaatggag 
12961 accatcaagc tgacccctat tccggttgcg catttgtaag ttccttgtct tgaaacgccc 
13021 ctttgactat ccacgccagc tcacatgcga aaggattcac caaaagcaag ttcttgctct 



13081 ctttggctgc gtcttaccct tcggtatggt cgacgatatg gggtggtgga ccgttcccat 

13141 ggtcagcctg gtcatcttca ccttgtatgg catcgaaggc atcggctcgc aattggaaga 

13201 ccccttcggg tacgaccgca atgacatcaa gatggatgcc attgtcggcg acgctaagac 

13261 tgaaattgac gttgtactgg ctgaatggcg cagactgatg gcgtcgctcg agtcagcgac 

13321 cgataatggc gatcagcatc tcggcaacgg tctggttgtg gaggaggagg ttaaggagcc 

13381 gaaattcatg ccgcctgata tgttcctcaa gcatcgtgcg agaccatcag ggcgataagt 

13441 agactagcta gctatttctg attgtgatgg ttttcagaaa ctgggcgttg tggagatcca 

13501 gcaaatctag cgtcgctaat aggggactat caccttaaga ctatctcggt tactagagat 

13561 cgacgcccct atcagcctgt caagcatgcg agtgattatc taggaggata tttccttcag 

13621 tgcgatcgtc aggggatatc tccatcgagt aactcggatc attgctggac caatccgacg 

13681 tcgtcgagat atcaagtcta tagtttcttt ttttgtatcc cgaatagctc cacttccttt 

13741 gcataccctc aatatcatga acataaacca tataaaagcc accatgatac ccctcgaata 

13801 cctaacaaca aagccatcaa tcaaatctac caactcaaaa atttccacat cactttaaac 

13851 tagaaacctc actccgccaa aatgaacctc ctccccctcc tcctcaccct caccaccttc 

13921 ctcctcccca tcgcaacagc cttcacctcc accttcgaaa actgccaaaa cctcaaccgt 

13981 ctcaaccaaa ccgtccaccg cgtccaccct ctcctcgaaa aattctcctc catccacgcc 

14041 tcccaatccc aatcatcgac caccacaacc accacccttg ctaacggaac caccatcacc 

14101 gtcaccaaca ccacctcctc cgccgcaatt gagaagatcg agggcgtcaa ggctctcctt 

14161 ctctcttccc aggagaaaat ctacggcagg ttgagcaact gctcgagtga tgcggtgtcg 

14221 ccccggaatg tgctggagaa gggcgtgaag agagatgacg acgacgacgg ttcttgcact 

14281 ttgaacgatg ttctggatca gttggtggat acccttgagt gtgtgcttag ctttgcgacg 

14341 ggattgttgg agaccattct cgatgggctt tttgatctct tgaagactat tattgagggg 

14401 gtggagaagt tgttgtaggg caattctcct gttcttctga ataggaggga ttggagagtt 

14461 gggggtggat aaaatatatg tatgattatt taattaattc gtgatatcct atacttgagc 

14521 ttcgaagcat attttggcaa tgatctattt tcttgatcat gtcatgctga atttagccat 

14581 agagctacaa aataaatacc catccatgat atttcgcaca atccatcaag tacgttgggg 

14641 gcaagcacat ggatatcact cttaactcgc aatcatatcc ctcggcctat cgtacactct 

14701 gggagtttcg tctttgcttc atgtgccgtg ttgaactcat agctagaatt attgatgtcg 

14761 atgaccatat tggtgcagta tcctcggacg tcaatagcta gaatgagaac tcgtttaatt 

14821 tgttcagagt atgtggatat ttctttataa gggttgtatc ggattatata catgatggtg 

14881 taaatcacaa tggcgcttct aatgtaagca ccccgtccaa tattgaatcc ggattcgcat 

14941 actctagcac ctgcgataca gcgtgcaagt cactcccata ttgtaagctt ctatcctcct 

150 01 caggcttggc cacgatgatc cgatgccgga agacttttct tgctgctagt gctacgattg 

15061 agggcgtgag gaagtcgatg ccgtgtaaca cggcgaggag cctatgctgt ttatcagcgc 

15121 tctcctgcga catgcaggat atgatttgag agactcactt tgaaaatgac ttgaagtaga 

15181 ggtttgaccg ggtcgatatg cctccagcta ctgcacgact gagtcgaagg aacacaacta 

15241 tgtcttgttg gtaccgggct atgtcggcac ccatgctgac cgtggtgctt agttgttgta 

15301 gttgtccgag aagctgattg ttattttctt gttagtggaa tttggctggg tagaaaaggg 

15361 gagacaaact gtgctgtcta cgtaaggatg gcctttcttc atttgagctt ctggtttgtg 

15421 gataacggat gatgcggaca actggccatc ggatagccag tcgttgtttt cttctagata 

15481 gacgtagccg tcttctgtgt cgtggaaatg agagatgaat agatagtcgt tctatatttg 

15541 gtcaattata ttgggtgata ggctgaaccc tgtggacaac atactagatg agggtttatt 

15601 ggatgggttt cgtcgcgcac tatgagaggt aggaataaga aattgtttgg ggcttccaaa 

15661 actcctgccg gtgtggtgag ttgctttgag tgcattaact atatgaacca gtcaatatca 

15721 catatatcac ttggatgtcc atgtgtttac ctcgagagct tcgcgctgaa tattgtcgtt 

15781 tacatggtta aagtttttgg ctatcacaac attgacgact tcctgcttgt gttcgatagg 

15841 ggctgacttg aggcgtcctt tatgatctcg taagtcatga taagatgata tcatgcccgc 

15901 cttattatat caatttagtt atacaatcaa gcaaagtcca gtaaattgac atacagactc 

15961 agtagtcagt cgtgagcgcg aaggtcgagc ataactgctc ctgtagactc cgggaggaag 

16021 taattcagcg cgaaagtcat ctagagatgt cgcgctagag cagtccagga tgctataaga 

16081 gagaccaaat gtatttgtgc atatctagac cagtgaggaa ttagcaatct tgtagacaat 

16141 atcacgggat cctgagttac cagggcgagc tcctttgcga cgtcatggat attggcgtcg 

16201 gttgtgtcaa tgcggcagtg ctggtgtgcc gcaagacaaa gaaagagagc tacttcaaga 

16261 tcagataatt ccggggcgag tcttgagatt tccgagccat cctccatgat agtactttct 

16321 tgcatttgac tgtcatagtg gcagctggga tagtccatga ataattaccc gcgcgataca 

16381 tggtccggcc acctcaggca ctatatccca atgcggttag tgccggagac ccgtccgcat 

16441 gtgagctcca ggagcttccc gatagacaac aacatccggc tggagggtct tatcccagca 

16501 ctatctccat tggatctgta tcgctttcca tctatacatg accttcctcg gtgtcaatgg 

16561 cgccgtcaaa ccctctggcc aactgggaga gactaggaga tagcttctac cgcaaagtcc 

16621 ccatatacga tgccatcttc gacgatgatg tcgaattaga gaactacatc atcgctggtg 

16681 ctccctacgg aggggctatt ggtatgtgca atagctgtta caattacctg attccacatg 

16741 ctgatgtctc tgttcagcgc tgcaccggga tgagagcaaa ccgtacaggt tccgtgatgc 



16801 tcagaccgcc aagtcgagca ttgacatcta ctcatgctcg gggaagcata tcaaccgaat 

16861 caatgtcagt cagttctgcc ttcacgtaag cactttctaa catctgtagt gggagtacgg 

16921 cacgattcgt ggtctcggct ggtccgacaa ggaagaactc ctagtgatca cagaagacgg 

16981 caccgtcaga cgctatttcg gcctccacgg cgacttcacg tcgttttctt taggaaatgt 

17041 aacgctctca tccagttatt gctctgctga aattgacatt cacttaggga gctgaagact 

17101 acggtgtgag ggcttgccga ttctggacct cgggctttgt tgcactactt tcaaacaacc 

17161 aactggttgc agtgtcgaat tacgacgagc cgcggcctag gctcctagct ccttgtccag 

17221 aaggagaagt ctcatcgtgg tcactaatac ctccggctta tacgctgtct cgctcagttg 

17281 aagtcctgct tgcagtcgac aagacggtct acttggtcga ccccacggaa gccgaagaca 

17341 aagtccttca gaatgggcca tttaaacatg caagcgtgtc gccaactgga agatttgttg 

17401 cgctaatcac cgctgagggg aaggtatggg tagtcagcag tgacttccag agcaagtaca 

17 461 gcgaatatga cccggaatcg cgcgtcactc ctcggacagt cgactggtgc ggagatgatg 

17521 ccgtcgtcat cgcatgggaa gacgaggtcc atctcatagg gcctaatggc gttgcggcaa 

17581 gatattacta tgatgggact gtccacgtcg tgcctgagtt tgacggtgtg cgtctgatca 

17641 ccaacgacac ctgcgagttc ttgcacaaag ttgttggtaa gtaatccccg cttagtctag 

17701 gagcaaacct ctgatagcct gcaagatgtc actgaggcga tctttcgact cgggtccacg 

17761 tcgcctgcct cagtcctcct ggactcaata gatcttctgg agaagaagtc tcccaaagcc 

17821 gacgagaaca ttcaacgtat ccggccaagt ttgcccgagg cggttgacac atgtgtcaaa 

17881 gctgctggtc acgagtttga cacatattgg cagaagcgct tgttaaaggc agcctcgttc 

17941 gggaagtctg tccttgactt gtacaatagc gatgaatttg tcgaaatgac cgagaaactc 

18001 cgggtgctga aagcagtgag agattaccag atagggctgc cgctttccta cgagcaatat 

18061 atgcgcctta cgcctgaaag gctcattgag cgattggtaa acaggcatga ataccttctt 

18121 gcgatccgca tttctgaata cctcgagttg cctgccgata ggatatacgt tcattgggca 

18181 agccaaaagg tcaaagtctc gacggtggac gatgatgccg tctgcaagct catagtccag 

18241 cgattggaag gcaaaccggg catatccttt gagttgatcg cgcaaacagc ttacgacgaa 

18301 gggcgtgctc acttggcgac tcagcttctg aaccacgaac caagagctgg gaagcaagtg 

18361 cccctgctct tggacatgga agaagacgag attgccttgg ataaggctat cgagagtgga 

18421 gacgttgatc tagtgaacta cgtccttctg caccttaaga ctaagctacc cctcgccagt 

18481 ttcttcagaa cgataaatac acgtcctatg gcgtccgctc tggtggaaac gactgcccgg 

18541 ggacaggata ccgaactgtt gaaggacctc ttctaccaag acgaccgacc tattgatggc 

18601 tcaaacgtcc ttctctccga ggcgttggac gcaacggacc tgccacgcaa gactgaaaag 

18661 ctccagctcg catcgcgtct actgtcagac tcaaaagatc cttcggtggt gttacagcaa 

18/21 aaactactca atgaagcatc ccagctgctg aaggtccagg aggccctcga caaggacctt 

18/81 gcagatcgtt cagagttcct tggcttgagc ttgaacgaga caatctacag actaatacga 

18841 tcaggctacg gcaaaagagc acagaagata cagagcgaat tccggatgcc agagaagact 

18901 ttctggtggt tgagattaag ggccttggta gctaagcgcg actggggcga gctggaagaa 

18961 atcgggaaaa acaaaaagtc cccgataggg tgggaggtaa gccggccaac aaccccagca 

19021 tatgcgcccg catgctcacg gataaataac agccatttta taacgagatt ttgggagctg 

19081 gaaatacaaa gctcgcttcg ttattcgttc ccaaatgcac caatttgccg gctgaggaca 

19141 agatggaaat gtgggtgaaa tgtggcatga ttgcaaaagc cggcgaagaa gccttcaggg 

19201 caaagaatgt caacgccctt gagctcctcc aggcgagggc ttcaggacca gccgccgttg 

19261 agatcgaacg gatgatcaac cagcttagac cgcggaagta gatagctggc gcaaccatgg 

19321 cggtatttcc tctgcagggt agacgtccac gcatcaggtg atatcatcgt ccacgtcaat 

19381 gccttcgtgt tcctccacca caggagtctg agtgacaggg gccggcttcg agcgtttcgc 

19441 tggcttagaa gccgcagcat cgtccaccat ggcatcatcc atgttatcgt cagcctcctc 

19501 gttgagggct tcgaacattg acttcgtttt acgattcaga tcctcccatt cagcctagac 

19561 actcatttcg tcagcattat tgaccaatca cgatggaaag cctggcgtcc tcaccctacg 

19621 agccttgacc gatttggcgc ggacctcact tttagtgacc ttcttctcca gctggtccat 

19681 cacggcctct gctctttcaa cacccttttg ttgcctcaag cgctgcgctc tagtcttagc 

19741 tttgcctttt gactgtttct tcgagacgcc tgcattagcg cgttccgata ggatcgaatc 

19801 ccgctgtact gtagtttctt cggcccgcgg aagagatgtc aatgacttgt cgacgtcgag 

19861 actgggggaa gctgcccgcc tcgcagcgcg ggaatgtttg gactgggcta gacaggggta 

19921 ttatgttagc attgaacgtg catttagaag cagacttcct gtaagaatag ctgtactcac 

19981 gccttgattt cgccatgatt gatgcagggg aaatttgatt tagtgttttc ctgcatgggc 

20041 gcaggtgaat aatttgttgt ggagccgggc attatcgata aaacctttcc gggtggcgga 

20101 agacacctgg agcgcactcg gccgctgctt attctgcgtc gcaacactgg ctggtcaaag 

20161 tcgatccagt cgtcacttcg ccagacgtgg tcgtcgcctc ttaactgctc tgctaagtga 

20221 tgtcaatgtc ctcgtcccta tgccgtggac tttgcctcct tttcatcagc catgaatgct 

20281 cgcagagagg ggattgactt cctcactccg aatcccttca acgtgcggca cagcaactat 

20341 gaatcaatac tgcagcaact gcagggcaac ccatccttag ttgcctctat tttagtgact 

20401 gtcatcactt tgtatttagc cgtcagctat ttgggcccag cttcgttgtc ggtggctcgg 

20461 atagcctgga atatacttgt ctacttgaca ccttcgcgat tgattgccgc cctagattcc 



20521 aagacaccga aatccgacga gaaccctctt tcgttgacct tcgaggccaa gagtgaagcg 

20581 atgcaacgta tacttggcct tgacgatacc tccttctcct ccctcttccc tcgcgctccg 

20641 gctttctcga gcttcgggac ctccctcttg agcagcaaaa acaacgtacc gccagggctt 

2 0 701 ggaaactggg ataattcgtg ctaccagaat agcataatac aaggactggc gtcccttcag 

2 0 761 tcgctggaga gattcttgga tcaaaacgtt gagcagctag gacaaaaagc tcttttgtcc 

20821 acgcatcaag ctttgaagga tatcatagag cgactcaaca gtgccgacag caatggccaa 

20881 cgtctatgga ctccggcaga tctcaaatct atgagcagct ggcaacagca ggatgcacag 

20941 gaatatttct cgaagatcgt tgatcagttg gatctggaag tccaacaggc gactcggcga 

21001 cacacacgca acttgggcct gaagatggca ggaccgcagg aacatgtgat tggatctggt 

21061 atatctcagg agctacagga aagtagtgca ggggagacac ggataacagg aaatcagatc 

21121 tttcggaatc cgttggaagg tcttcttgcg cagagagtgg gatgcattca gtgcggttgg 

21181 accgaaggtc tctcactaat tccttttaac tgcttgactg tgccgcttgg cccgaaattt 

21241 gagtacgata ttcgagagtg tctccatcat tacatgcact tggaacccat tgagggggtg 

21301 gaatgtgcaa aatgtacatt gttacgtgta cagtctcagt tactgaactt gctgaagcaa 

21361 atcggcgatg atgaggagac tccgagtgct actccggagt caccaaagat atcagatgcg 

21421 cttcgaagct cggcacagga gcgcctacag gccgttgagc aagcgttgga agaggaggat 

21481 tttgcagaga aaacgctctc ccaaaagtgc catattcccg gaaagaatcg agtgtcgtcg 

21541 accaaatcca gacaggccgt ggttgccagg cctccacaat gcctagtcat tcatgtcaat 

21601 cgaagcatgt tcgacgaaaa tacaggaatg ctccggaaga actacgcagc cgtcaaattt 

21661 ccaaacgccc tcgacttgaa tccgtggtgc ttgggcgggg tctccaagaa ccagagtgaa 

21721 ccaagtctag aagcttggga aaccaatccc agcgtatcaa tgctctcgca tgcaggaaga 

21781 ggcgcgaatg caggtggtca ctatcagttg cgtgctgtga tcacccacta cggccgtcat 

21841 gagaatggcc attacatatg ttacagaaaa tatccaaccg acacttttcc ggcccacgtc 

21901 ccggacgcga ttatcgaggc agatggagac aaggagagag acgagcgttg gtacaggcta 

21961 agcgacgagg acgtgcagat ggtcagcgaa gcgaacgtca tgtctcaggg cggcgccttc 

22021 atgctcttct acgaggcggt tgaggactac tccccggagg ccgcagagga tgtggattct 

22081 gggttggcgg aagaggaggg ttcagcacct tcttctagct gcacgccatc ggagaccatg 

22141 tccacaacat ctgctgccac aggagactcc acgtccgatg tatcgcaagc gaccagcgtc 

22201 tcaacggcgt taaatgttga acctcgcgtt gagaaatccc tgcttatgtc cgacgttgac 

22261 tgagccagcg ccgtctcgat tgcccaacct tactctgccc gattgtgtat atcagcatta 

22321 tatgataccc cgagtacgac tctttagaag cgctcatcca taactttttg actgtctgct 

22381 ttacgttttg ttttggagta accgacgaga cgttgaacac ggctttgtat aacactgtat 

22441 gtacttcata tctgatcgcc gctttatagc taaagtacat tgtttttttg acactccata 

22501 accgcacccg tataatccca agcatgccgt atgccccgac tccctgttcc agattgtaag 

22561 taaagcccca gtaacagcac aaacatatac ccaacgcccg ctgaaatatc gaacaaccca 

22621 ccctatatcc atcaaacgaa ccaaggatga atcagaatac accagcccta gtcataacat 

22681 catgaaactg ctccaacgta accccagcat gtacccccgc accaccattc gcctccaaaa 

22741 tcatatcttt caacagctca tcccccagct cctcctcctt caactcgcga gcaatcctcc 

22801 tcagatgatt caacgtgatc acccccccag accccctcgt aaaaagtcga tacgccgcgt 

22861 ccacctccgc cgccatcgca tcatcagatc tagatcgtag cttcgccgcc gcgacagcca 

22921 gaaacggctc gtatggcaca taccccgtat ccgtggggtc gacggcctca atgatggagt 

22981 ggagttcttc gctgtcggtg gggtcgaggc cgagggctct attgtttcac tgtgttaggg 

23041 attccgttgc cgcgatacga atgcgcaggg aggggatacc gtacactaaa gctttgcgga 

23101 catcaccgcg gggaatcacg ccttcctttt gatctgcgaa ttcttcgacg gtctcggaaa 

23161 agaggtggaa aacttcttta atttcgtttt cttccgatgc gctgatgtcg ttttctttgg 

23221 cgagtttgga ttggcgtgcg cgtttgggcg cagggccgct gcttggggcg ctgcgctttt 

23281 tggggggctg gttgtggtga tgactgttag cgcattggtt gagggagagt agatagatgc 

233 41 gataagcacg aaggtgactt accatggcag cgattggctt ggtatgctct ttgttagtgt 

23401 gtaggtagag gggaagcctc gtcgcaaact gcgatgatgt caagggaaag ggtggttgga 

23461 gatgaggcgc gggggaggtt gagccatgga cctttggtag gctgggaagg atcgggattt 

23521 gccgattcac attgcactgc ggttcggagt caggatgggg atgccaatga ctccgaggaa 

23581 gatgaggtga gaagaagatg gtttgcttcc attgacttcg attgaagtga atctcgtgat 

23641 ggcggcgggc tgctgacgta accccgcgat gcctacgagg atctttctgg tacttcagct 

23701 gcgtttgcgc cggcgaggat tgcatttccc agacatcttg gagttgcgag tcggtctttt 

23761 tgtgtggaat ctacttcggt gctctttgat actccgcagc tttagttcga agacgccccg 

23821 tctcagccat cgtcatggat gtcgagctgt atgtctatga cctatccaag gtgaattcta 

23881 tagcttcagg tgtacatgtt cagtcacgga gtcatgggct aatgcggaac agggtcttgc 

23941 gcgaatggta agacacgcgt tgaatgtggc agagtcgcct cgtgtctaac tccatctagt 

24001 attcgctggc cctgacgggt acgcaaatcg atgccatcta ccacacatca ctggtgctga 

24061 acggcgtcga gtattacttt ggccaaggca ttcaaacggc catccccggc agtactcatc 

24121 atggacagcc catggagaag ctgcatcttg gcaagaccga gcttcccctg gacgtgatcg 

24181 aagaatacat ccagtcgctt gcggaaatct atacccctga ggtaggagct cgtaacccca 



24241 gcccggtcat ctctctaacc cggccatcca gtcctacgat ctcttcctgc acaactgcaa 

2 4301 caacttcacg caggatctgg ccatgtttgc tctcggcaaa ggcatccctg agcacattca 

24361 aaacctgccg cagactttcc tgagcacccc ttttgggcaa atgatgaagc cccagatcga 

24421 aatggccctg cgcggtgtaa cccaaggcac tggcgctggc acgggcacag tcggcactca 

24481 aacgcccacc accagcgcac caaccgcccc tgcgcaacct gcccccgtta ctcaaggttc 

24541 tgtgcgcatc gcaagcaacc tcgcccagct cgagcaccac ctagctgccg cggccgactc 

24601 atgcgccgtc atcttcttca cctcggccac ctgtccgccc tgcaaaatgg tctacccgac 

24661 gtacgacgaa ctcgccgagg aagcgggcgc caaagccacc ctcatcaaag tcgacatcag 

24721 taccgccatg gacgtcagca tgaaatacag cgtccgagca accccaacct tcatgacctt 

24781 cctaaagggc cagaagctcg atgaatggtc cggcgccaac ccagctcaac tacgaggcaa 

24841 cgtccgtctc ctcctggaaa tggcccatcc tccccatcgt caccagcaac tccgtctccc 

24901 cagcctccaa cgccccatca ccaactatgt cacatacaag aaagtgcccc ctctcgacaa 

24961 gctcgtccag aaactcgacc ctcaccacga agaccctcgc ctcctctcta tgatcaccta 

25021 cctgaaacac cgcacctcct cttccacccc ggcagcggac acccccttac cccaagacct 

25081 cccatccttc gccacctacc tgcaaaccac ctgcggtttt ctagccctag accacctatt 

25141 cgctctcgtc gacctcaccc gcctgctctt cctcgacccc cgcgtatccg gctacttcgc 

25201 cgaagagccc ggccacacga ccctcctcac cttgttatcc ccgtctgctg gcctctctgg 

25261 atgtccctac aacctccgca tcgtcatgct ccaactctgc tgtactctct tctccacccc 

25321 tctctaccgc gaccaactta ccacctcctc ttctctcctc ccaaccctcc tccacctcac 

25381 cacctcctcc ctcctcgact cccacactaa cctccgcgtc gtcgccgcct ccctcgccta 

25441 caacctcgcc gcactcaacc acaacgcccg cttcgccggc catgccgacc ctctgtccga 

25501 ggaaagccag gtcgagctga ccgcgtcgct cgtggaagcc atcacccagg aagaagagag 

25561 ccaggaggcc ctgcacggtc ttctcttcgc cttgggattg ctcgtctacg aagctagtcc 

25621 tgacagcgca gtcgtggatc tctgcaaggc catggggatt gcagagacgg tagccgcgaa 

25681 gaggaatgtg gaaaaggttg cgaaggagcc gttgatcaag gaggtggggg aggagttgtt 

25741 gatgaaaggg ttgtagatta gtagatttag ataggaatac tccttttctt cttgattatt 

25801 atcagataca gatggcttag atgggttata tggatgttcc attaatatat gatgtacttt 

25861 attctgtgac tggattgtct gactcctcaa ctatcctacc ggtgtcctaa ttcatagatt 

25921 cccccactaa caaaaggata tccctcatcc catcccaccc atcctatccc atcatctccc 

25981 ataacattcc aaatctctaa ccagctgaaa caaaccccct caatcaatca atagttgcct 

26 0 41 aacatcagca ccaccaccaa atgaggggtt atgttatagt aatcaatagt cccatctaca 

26101 gtgtctcgtg atgatgatga caccttcagc acagatcaat aatcgaccat agtcataagt 

26161 accttacata aaaagaaaaa agccaatcta catggtagta acgtgagtag gaggcatgtg 

26221 atggctggct ggctggctgg ctggttgaat caaaaggaat aataatgctg ctactgctac 

26281 tgtcactact tactgcatgt gctgtttgct tttatctacc taccggaatt atgtgggcag 

26341 ttatctggtt gttacagggg tgcactaatt actagttact agtgtctctc tactaaggtg 

26401 gaactaatgg gatttgcgtc gggtatatcc gtgcatgtac ttgtgtcaga tcgtatgaag 

26461 tagttacagg taggatagta atgtagtagt aggaggatac attattgtat gattgaccac 

26521 ttcatttttc ggtggttatt attcattcat tcagtcattc atcattcaca tggtgaatgt 

26581 aaataggtat taccatacta attaggaaaa tggaaagaca tcaagtagtg tgaaacaaag 

26641 ccatcgctgc atgaattaat gggtatgtcg ctgagctatg caaaatgcaa tccatctaaa 

26 701 gtaacgcggg agctgttctg tgtaacacgc tcggggtatg agaaatgaag gttggaaacg 

26761 aaacaaggaa gcgccaggaa acaacatgat gacatacaag acggggtatc tgaacatccg 

26 821 acgctgaaaa caaaaagaaa agaacaaaag aaaggggcaa taattagatc ggaaatcgcg 

26881 agtaagaaaa agaaaaaagc aaaaccactg acgaaaaacc cagctgagag gggcatctgg 

26941 gtctggtatc ccgaaatgtt gctaaatcca tgaccaccaa aaccgacgcg acaacgcatg 

27001 aaacggaaca ataattaagc aagttaaaga aatccggttg acaagagggg aaagaaaaga 

2 7061 aaccggagag acaaagaacc acaagcaagg aaagacacaa tgcgcaacga gaagccgagt 

27121 atccaagcgc gtgccatgac cctttgggcc agcatgatcg gggtgagtca ccatccgtag 

27181 gtaagggaag ggaatgcgat cagagcgtgc cctgacaatg gcggtgaggt tgtgaagaat 

2 7241 gataggcagc agacagagcc ctgtccagcg tgcatcagca gcagccagag aggggaaatg 

2 7301 tccccccaat ttccatcaaa tgcgccggct gcgacagaga aattaggtaa tcaggcagcc 

27361 aagctcgggc accaaatcag aggccgcgaa ggcgcaactc cggcctgcca attccgcccg 

27421 gctacagcgc ctaccaacta tctctcctcg cttcgtctcc cattccttcg tcgcacgtgc 

27481 accctctgtc gtcatattcc cccggtccca tgcccgatca tgctaattca tggccctgtt 

27541 cttgaaagtc cgagccttgt gatgtcccag actcccaagc atcagtgcca gaaatgtgtc 

27601 attgtggcgg ggggtccatg actgattgct tctgtcggca accactcagg tcgggggtct 

27661 tcccaaaaag gggggataag atgtggtagt tggaaaaaaa gaaaaaaaaa ttggtggaaa 

27721 aaggaagcaa ccagatagac gtgttggtcg catttctgga cagctgagct ggaaacagat 

27781 cggcgtcgat cattcgacag gtctaatctc cggctaaatc ggaccaaaat gttggcgtgg 

27841 ggtgacgaag tcgtggggta tcatgacatg cgcaagtaga tggtttggcg tattggtgtt 

27901 acaacgaaga aaaaagtaat gagaagatga cacccttttc ggtgtctctt aggcgcggca 



27961 gacggtggct ttggaagcag atgcggtccg ccgcttcctc tcggggatgg ccttgcgctt 

28021 cttgcgcttt gcgttctcgc gctctgctgc tgccatctgt gccacccgca tggcctgctg 

28081 tgctctcgca tgggcctgca cctgagcaag gaatcgcgcg cgctgcgctt cttcttccgc 

28141 tgcggcgcga atcatctgga tctgctgctc ggtcgtcagg tcaacgtctc ccatggtcgg 

28201 gtcctgcgcg gcggattctt cgtcgatcgc actctcgcag ggggatccag agccgatggg 

28261 gaggagatct tcatcggaaa ggtaggcgct ggcagtggaa ctgtccgagt ctgcactcag 

28321 cagcgaaggg cggttgggcc aagatgggaa ggcgcaagag ttgttcatct catagccgct 

28381 ggagaaggag cacttgcgag gagtgatgtc gagcaaggaa tcagtgggag agggtggtgg 

28441 gggaggcggc atcgaagtgg gtgagcaaga gctgccgaag ctaaagctga aggagacatt 

28501 ctgagtgcta gtggtggcct cgctcagaga agagcgaaca ctagactggt tctgactcca 

28561 aggaagaaag ggatacattg tgagaagagg tgttgttatc agtagtagta gtaagtggtg 

28621 gtggtaggtg tggtggtagt aatgggagtg aatctcggtg ttgtccagag cgatgcagga 

28681 agtataagta gtagtatgac tactactagc tctgtgacag taggttgatg agtggaggtg 

28 741 aagaagatcg acacggtctg gggaggtcaa gacccaagag ttgagaaggg aggagtggga 
28801 ggtcagataa ataaataagc ttggagagga gattctgggg attgacgctg atcaaggaac 
28861 actccaccgg tggatcaaat ggcggagaga ggaaaatgag gtgaatgtat aaatggaaga 
28921 agggccagat cgatcggggg aggcgtctgg actcccttgc atctcccaac agataaggcg 
28981 tcgtggagaa cgaagaagga ggtgtggagt ccctgttaaa gtcggaggga ggggggttgg 
29041 cagcaaatcc gatttttcct cttcggatta atttgattat tgatttattc aattttctga 
29101 ttttcagaaa atcgaaaaat cgaaaatcga atttcttttt ttaaaagaaa ttcccttgtt 
29161 tgggtttgag ggtgtcgccc cactggggga attaccagtc accagcccga caaagcggcc 
29221 agaatcactt ggaattttcc gcaatgctgg cctgaaagca aagcatccac tggtaatcct 
29281 gcgtgtcaag cgcggcgatc aggcggaagg ggcgttccgg tcgctgtacg atggatgaag 
29341 cagcaatcca gttggatgga cacgagagag tactaccact gtagtagtag cggtagtggc 

29 401 aggtaaacaa gtagacgtat gtgtgtatgt ggaaaatgtg tggaggtatg tggatgatgt 
29461 cggtgatgtc gcggcggtat gtcaatcaca gtaaaagtac caagtggctt ggtatagtat 
29521 tgattactgt gatcagtagt aagtagtaat gatcagatgg tactattgtc aaaagtgata 
29581 gatgttcgag ttgatcagtc cagtgcgatg tagcagcgaa ctcctgaaat ccatcatcgg 
29641 accagggact aaaggtacat atttatctat tattatgatt tattcccatc gggtggattc 
29701 gaccctaatc tgacagccag acccccggtc cagggcgtgg atctatcggc tttcttttgt 
29761 tcttccattt taatttaatt cctgggcgga tttgctcgct cctccggtcc gcggcccggg 
29 821 gagcttcgga tggggggcga gaaaagggaa aagggaaaac gaaaaggaaa agcgtgggga 
29881 tggatcaagt gtcaaagaat gagtcaactt aatccactgc tccctccgtt ttcttccaca 
29941 tctccgagct ccccatcagt ggcacctccg taattgattc tttggttcgt ccgaccccag 
30001 atttattggc atttagccaa gtgcaggata ataccagatt agagaatatt tgtagagaat 
30061 acaactgggt gccctgtagg ggctacattg agatacttgc aacaacaaca ccatagtatt 
30121 agtagtagta ttcctgtact ttgactatag tagtcgaagg ctgacatgat acgggtccgg 
30181 taaatattcc cccatacccg accgactgac tgacagtggg ggctcatcat tcatccatcc 
30241 tttggtgttg ttttgttttc cggcggctga cgatattaat ttaatcagaa acaacgcctc 
30301 aatgcgaatt actccgtctg cccccaattt tgattattat ttttcgggcg gatgtttcat 
30361 acttccaacc tggcaggcag gctgtacacc ataagataat caactaatgg attattcaca 
30421 accccgccct gttgtccacc acaaggacgg agtaagtaag tacagtagta gccgacagga 
30481 aatcaaaaat cgctttgatc cgcgatatcc tcagcacaaa gcgacaggcg gtgttgggtt 
30541 ctcgtgcatc tcgcaacggg atcttgcggt ctgtgtagtt taccaggttt tatcagccat 
30601 ctgtcggtca gtccaacgat tcattcaagc caatcccctc ccctccctgc atacataaac 
30661 agcatcatca tcatcatcat caccccccgt aattccgcgg ccaaatcagc acgctctgac 
30721 ctcaccgctt cagtcaattg tttggcatct tcgcagcaac taacaccaca gcatgcttcc 
30781 gatggatgtc gacatacatc cgcccgtctg tatgagtatg caagtatgta tatacatctg 
30841 gcgataacgg tagtgggcat gtagtagtat gtgggtaaaa gaaaaataaa aagtggttta 
30901 gttagtcagt taccaactcg gtccaaaagt gactaagatg gtttcaaatg cgtttaggct 
30961 ttttgcatct cgcgttggtt ttacaactca tcccattccc tccctcgagt ccccttcaac 
31021 tcccttctaa catggggttc gctggggggg tggtctggcg gtcaggcagg caaacacccg 
31081 ggcgggcgcc tcagaagtcg gctgagacgg acatggacat ggaattgttg gcagtgctga 
31141 attctcgagg caaacaagac agtcattggg tttccttttt tttttttttt ttttgcgaca 
31201 tctgattatt gttatcagct cgtgccgcga taccagacta agtggaaaag gcaaaaataa 
31261 gcaagcaaag aaaaggaaat aaagtaattc gatagagggt caacttactg tcacaagcaa 
31321 aagtaacgta ggtaatcggg tacctaatcc atgcgcggta tccgcggtgg atcccgggta 
313 81 ttcccttatg ccctgctacc ttactgtact tcggctgtgt atgtccacat ctccatctta 
31441 gcccgatagg aaagctgcta tactctacaa tactcttcac aaatgataaa ctctacagtc 
31501 agtatcctct caaacaacag gcggcgcaca cttcttattt gtacatccaa tccatggaac 
31561 ttgtcttgca cggatccgac cccggatccg ctactgtcaa gtcggggggg ttgtgctgga 
31621 acgacccaga gtacatagac gtgtggttcc ccgttcaaca atccaccccc cgcgacccac 



31681 gcgtatctac ccagataact atccttcact acttttctac cccggacgtt gtgtaccacc 

31741 gccaagccct gttatccgtt ccgtaatcct cactccttcg gagccgcgta ggacttaaac 

31801 agttcgtaaa tgaaagaggc atgaatagag tcagctgtgc ctgtctgccc ccaatctgat 

31861 ctgacgggcc gcaccacccc caggttccat ccactgtctg agttgtcgcg gtctgccctg 

31921 ccctgccttg ctcggccctg ccccccgtac agccagtaag tttcagtcaa gttgggggag 

31981 gcgaagcagc aacggccccc cccaatcatc cctcgagagc atcggtagtt gtccgtttca 

32041 gcccgcgaac tagtcacgtc cagtcaccta aaactgaaaa tggccagact aggcaactct 

32101 tcccactttg tgaaagaaaa caacgacggc gggaatatac ttctttcgga taactacatt 

32161 ttccaaatcc aatactcctt taactacagt aggcaataga atgaattact ctttgtgtaa 

32221 gtctagccta gctgatgaag atcactgcta gtagtacttc tccgaccgtc gcaagtatcg 

32281 agaggcgaaa tgcacggtta atctgggcgg gtagttttta gccttgccgg gcgggatggt 

32341 gcctcggcat ctggggggct gttgatttat ccattcatcc ctactatctc ccttcctttg 

32401 ttcggtagtg gtagtggtag tctagttctg acaaggcaca cgcagggctg gctcgctggc 

32 461 tgtgtgtcca gtgatcatct gagcgttttg ttgtgtgatg actgactgac tgactagtcg 

32521 tacagggaca gcgtgtttca gactttgggg ctggatggca ttgtgaatgc agtgattact 

32581 actggtacta tatttattgg acaacgaatg aatgaatgac tcgatgcgat gcataagtac 

32641 atacggagta catatgctgg catgcagtat gtgtttgaaa gatatatata ctatatatat 

32701 atatcaaaca agcggagtgt gctttatcaa ttctgggtcg cgatcaatca atcgatgcaa 

32761 tgcatacaac taaattcata cagtgtactt tgtactactt ggttcgttgg ggtggatctg 

32821 aacgaatgat ccaattctag acttctttgc atatgtaccc ctggaactag tcagtcgcgc 

32881 caaagaacct gatcccagct agtcactaag tcactaacaa cttatctttg tgaagtacgt 

32941 tggccgatgg tctgtctcag ttcagctcca ccacaaacaa gatgcagtgt gaaaggcatc 

33001 aaactgacac tgatcatccc agacaaagta cgagttcagt atcatcacga gcacaccaat 

33061 accatgtcca gaacggccca atgactacta ctccaatcta ataatgaaga tggatgtgat 

33121 tcatttcttt ctttctgaga tagtaaacgc tttgaccgac cgggcaccag ccatcacaag 

33181 ccatcacaag cacactattt atttgaccgt cttgaatggc ttcatcttgt ggttgctgct 

33241 gctgctgctg ctgctattac taccggagct gattggcgct atcaggtgga caatgaaacg 

33301 gcccagcggg actgaactga accgtgacaa aaaagcctgg cctgtcagtc agtcagttga 

33361 cgagtaacat ggtcccgacc gggtgattgg gagcaacaca cccgatgcgc ccagacctcg 

33421 gttgttaaag agggtttcct ttgaacctga ctgacgggta aatgaaaagg tactaacccg 

33481 tcgacggctc tagaccgtcg ggcctaattt gcccgagggc aaatgatcag atagaagtca 

33541 atcaatgata gtccacgatc agtcagttac tcagtcaata gtagttctgt tcatgcacaa 

336 01 atgcttttct ttcacccgcg gattgcggaa cccgcctccg atgaatgatc gaaccgccgt 

33661 gctggattgg atggtctcca ccgccccagc gatcgtccgt tttgggacag agtaaataat 

33/21 gagaagtaat tttccctttt gggacgggga tcgtctcttc ccaatccggt agctttctga 

33781 gctccccgcc aacccaaagt acatccgcat ttccccccct cgcggccctt taagactcat 

33841 caactgcact attttggaga acacgacgcg ggtagacccg cggaaccgcg gtacgagctg 

33901 ccggactagg acttggagag atcatgggga ggatgaggaa cttttgcccc cctttttgct 

33961 tttggtgaaa agtatcgcag gttcttcagg gtgcaccgtg agcggggcag ctccccccgg 

34021 ccccggtggc tgcagtagtt cgcttcgggt ggaagtcaag taggaagtac tgagtacctt 

34081 tcagctggtc ctgaagatac atccactcac tgtgaatgga tcatcatctt gattaaggga 

34141 ggaggatgac gacaatgacg actatctgtc catagctcca gctctctctg cacatgcgca 

34201 ttccactcag cttcagctgc acatttccta gattgtgtac agtatactta cgtatgcact 

34261 tgcatacata agttacattc ccgaatactt ctggtcaaga ggaccagaag tactccggat 

34321 tcccattctg gggtacacct gtacttttac ttattgactt actcttccgg gacttgctca 

34381 catttgcctg ttctccaccc tggcgcatcc gcgtatgcaa gcctaacatt ccagactgtc 

34441 cccgactttg ggtactgcac ttccgatcga gacgatccat ccaactgatt ggtgctgacc 

34501 ccccttgtcc atcgtcatcc catgatcccg gataccctcc tgcattttcc acccacccac 

34561 agctcttcaa tgcaaggcat gcttcccagc ttacgtggtg atgcctcggt gctacccctc 

34621 catgagcggc gttttgcact gttccctccc tgctggtgtc cttcccttcc accacttgcc 

3 4681 aatctacggt actgtgtctg cagggactgt caaagctccc aaacagcggt gattcaccgc 

34741 ttcccccgtt attggttctc atgatgccat gccatcggtt tcgttggtcg ggcgaaacat 

34801 tctactcttg tggtgtttgt tatcgatcta gcgctttggg ttggactacc ttaggcatct 

34861 aaagatgatg catccactca tggtggttcc ctcctagcca ggaattggga cttttgcgtc 

34921 gccgaccggg ggaaggggtc aaatatggat gaaacattgg aagagcacta gaagctagga 

34981 ctgaagatcc aggagcgcgg tcttggactc ctcacaacgc acaccctggc tgccccgcca 

35041 cggccttgcg cggagtttaa aggtaccact tgctgccccc ggagtatctg gggatggaga 

35101 aagcacatac cacaccgtcc ggggaggacc gcatttgctg caccacaatc tcgacgggga 

35161 acgggtcccg gagcacggat cgagaaataa ggtgtactcc tcgcagcgga tgtagcagac 

35221 ggatgcctgg atatggttcc gtctgcactg ggttcccggt ttggcaatta gagctaccca 

35281 gctgggggat ccacgaccct gcgacgggtc tttctgagca gtccggaaac aacttccccg 

35341 gttgatcctc gacctgtggc tcgcacgggg ggaagacgca tcatatgtgt ggcgtcgcat 



35401 ttggaatgtg tggtatattt atctggctac atacagagtg aggcgaggta tgtggcgagg 

35461 gtgagcgggg acaggtggct tgagttgaca tacacagtag tactacggtt ctgatctccg 

35521 gatagggccg tctccagaag tcaaagggtt gtattatcga catgacggcg cggagtccat 

35581 gcgctctgta gagtcggtca ccaagccttg actcgtttcg ttctattccc gccacctgcg 

35641 acataatcgg caactccgca gtttaaacaa gtttattgtt ggcgggcgtc gtcatttact 

35701 gttgtcaaca ccatagttat ccttttctcg catgtccatc tgaccacgac tatcaggctt 

35761 acgatagcgt tcaatatcgg catgcgctcc gttgttccat gcagcgaccg accgcttagg 

35821 ctcatacagg agtcgtccta cacgggcggg ggttctgtag acaaaggcca ttatacttat 

35881 tctgggttac gaaggtgagg ttggacggtg caacgtgcgg gcgggcagtg cttgcgctga 

35941 cgtttcactc tcgatcatct acacggacac gcggagacgc cgcggattgc agtccccaga 

36001 tggagaaggc cgtacgggtc agcaggtaat ccctcgcgtt acactgagta ggatgagtag 

36 061 gattcgtatc attggagcct cattgcagag ccgcgttgaa tgcatcattc tctcactgat 

36121 tctgtttgct gagacctcgt aacgcttcag ccttgttgga atgatgctag ggtttcatcg 

36181 ggtatgatgt cgatggtggt ggtcggggat tattggggat gatgacgatg atgatgatag 

36241 acacgttgac atggatttat ccacatttac cccatgtttc tggcacgtcg cttgacaccg 

36301 atcatctttg acagctacta cagcagctgc tgctgctact acctcggggt ggcgaatccg 

36361 ccgccaaggg cggataggat atagtgactg tgactgtaat cttcatgttt actatctcgg 

36421 ttatatttac taattgtcca ccatacacag tgggataaat ttagccacat aatcagattc 

36481 gacattttta caccgtgaat ataccaagca gccacctcac ttggaattaa taatatcatg 

36541 cccctcgcac aagatccctc actaacagcc acaaactact caccgtccat cggccatgta 

36601 acccactcta aaataggatg gtgacacagc attttgtgtg tatatttcct agaatattct 

36661 taattcctca aatagccatt tccggctcga tcgcatagtc tactaactat ggcaaacaat 

36721 gaatgactca ctgatgaagg atactagata agcacactat acacatgtac atacatcgcc 

36781 ttatcgtaca accggacatc cccgtcatca acaaacatta ctgaaatcac aataataaca 

36841 tcccacatca cacgggaggg gtaaatagca atccagtaag tggtgaggta agtaagcccg 

36901 ttcgttctgg aacgtttcca cacacagggt ggattagaaa ttatcattac ccaacttact 

36961 gcttaacctg gataattata ataataatga taattattat tattggttta cttatttatc 

37021 cctgacactt gaaaacatat tttgatatta ttattattca gattgtctcg gatatgacac 

37081 cgctgcaaca ctgcgatggt gcaatggttg gttggttggt tgattagttg gttgggtacc 

37141 caagtaaata aggagtagat agtaggtagg taagtaagcg tggcgcctaa cgaacattcc 

3 7201 atcgttccat tatccatcca tctccatcta ctacccattc atgataagct atgaggtaca 

37261 tatccttgaa acaagtgggg tgtgtatgta ccctgtctgt ctgtctgtct gtctgaccgg 

3/321 gtagatcggt taacacggga tttcatgaca catgttactt acttacttca agatcctgat 

3/381 ggattatagg aatggatggt tggatgggtg gatgataaat attcaggaag cgcttgttgc 

3/441 gccgcgccgg tcggtggttc ggcataatat atttttatct aaattacaag ttgttggttg 

37501 ttttatatgt gggatgatgt ccacgtgtat gtgggatagg gatatattcc cttaaccttc 

37561 attacgaagt atccatagta aggatgatat aacggaagat gtcggtacga cataataata 

37621 gtagtagtaa tagtagttca ctactaaggg aatggattaa taatgaagga tcaaagatga 

37681 ggaagaggat gaaagctaac tgacggacaa tatccgatga taaataatcc caccaattag 

37741 ataactacct atcaatcctc attcctccag cttatcttag ctaaggatat attcaccctt 

37801 ttttatcagg agaatagtac aaggatcccc tcgttccaga aggtttcctc atttcatcgg 

37861 ctaattgcat gtgagattta ctctgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 

37921 gtgtcacggc aaccgtttac cgtttatcct tcccttccct gttctttttc ccttccgttc 

3 7981 cgctactaac taccgggaga gggggccgtg aactggcggt gagaaaatga ttccttgttt 

38041 ggttgtactt ggtgtgatcg gcacgagggg agaggattgg ttactgatta ccctgtctaa 

38101 ggtttatctc tttttctttg tgtaccaagt agtaaataaa agtagtgtgg gtgtggggat 

38161 aaattcctat tcgagtggga tctggaaaat agcaggtttt gggagatata taccatagtg 

38221 ggatggcgtt gagttacgta gtactcagtt atcttcctca actaatcccc gcgcgcggcg 

38281 gatagaatat ccaccactat ttaggagatt atcttgatat ggtttcctta ctttagactt 

38341 tccgtggccg ttgaattaac caaaaaacag ggaaaaagaa aagaatgaat gacaagttag 

3 8 401 ataccccacg ggtcggtgtg tgtagccggt caacggctgg ctacttacgt agttggcaac 

38461 tagaatggga attgatatcc ttacctgaac tacttactta ctagctactt ctgtatattg 

38521 tattatcctt gctggaggat ggggttgggg tttagcggtg aaagaagtac tactcagaat 

38581 tcattcctgt tgtgtttcgt tgttcttgtt tgctggtagt tggtggatgg ttgcatcgct 

38641 tattgaactc tttctgcttt tctttcttgt tttgtttctt gatgcggagg gacttatggt 

38701 tgaggtatgg gtctagatct agactcggtg ggggtggtgg gtgggtgggt ttgttctttt 

38761 gttctttggt tcttcgatgc tggtggcggc ggagcggtgt gcgattatct ttgtgtatga 

38821 ttccgggctt tggtggtgtt gcttgaaata gccttgtact tcgtagggag gtatataatt 

38881 atcggacttt gagacaacta gggtctttga aagagtttta ttgtatacta tagtgaaagc 

38941 tctgttccag cttagtttgg gtagttacta ttgggagatg tagctgtata ggtctcaagg 

39001 gaaatcaagg ccgttgcaac ggtatctgca ttctatatag catctgtcta cttaaaaggg 

39061 tatagagtgc attcagagag gttgggatac tgtcacaatg agacgggcat accttgaaca 



39121 ttacggtatg aatcaagtct catatagtac tgaaaggtct tcggattatg aagggagagt 

39181 actagattgg ggcatcagag aatctatatt ggtaaacaga ttaatatatt gagatgctgc 

39241 acaggcactg atccacgaag gtggagcatg tctagtaata gtaagatctc acttctcaac 

39301 tgtagtatgg agcttcaacg cgatatgcag ctccgtgcac atgtggatct gtcccgatat 

39361 ctatcatctg ctttgagaat agtagcaatg cgctagagta gagcctcaag catgactagt 

39421 gcaggtgcta gttcgttcat atctattggt ataaattagt ggattatgtt tggtttatcc 

39481 tctgtgcgga ctgtcctcgt caaaggcctt gaagtggtca gtcctgtcat gaatctgatt 

39541 tcaacatccc ttcctatcat aattacactg tctgcccgac actgccagca tgttgattgc 

39601 tttatgactg gcatggttga tttcgagcat atgcgctgta atgccaggta ccgcactcag 

39661 gagctttgat gcaggccaca tagtaggtta cttgccttag catcccattg gtttgagatc 

39721 tccgggctgc tgtagcgtac gactggtgtg actggcatgg ctggtagtca gttcctgcta 

39781 aggcaactat cttgaagcag aatggtaatc ttcgagcatg ccgatccatg ttttgagact 

39841 cgatatttgc tttgacgcag cctaatttca ctatccgttc atagtggtat tagtgtttga 

399 01 gcaggcagct tttcagctgt ccagtattcc aaaatccttc aatattgacc gcggtgttat 

39961 cggagttatg ggcattcctg atggaacgga gttaaggctc gatatttccg aacggcgcag 

40021 tatatagtgt tgaagatggc tgtagttgca accactcata gagcaccctc agcctgcaga 

40081 aacatctatt agtctgtact tcactatggc tgctgagcaa aagctggtcc taatcactgg 

40141 tacttccccc accttaacaa ttagcttcat ctaacaactc caggggcaaa ccagggcatc 

40201 gggtttgaga cagccaagaa cctaattctc tccgacaatt accatgtcat cttaggctcg 

40261 cgagacccag ccaagggtga agaagccgcc aaaaccttgg aggccgttcc cggtatcaaa 

40321 ggcagcgtct cgtcgattca gatcgacgtg acagacgacc agtctgtgga taatgctgct 

40381 gcccagataa aagctcagta tggacggctt gatatcctgg tcaacaatgc tgcgatgagc 

40441 tcgatgaagc atccgccaag tcgggaggca atgcgccaga ttctggatgt aaatgtggtc 

40501 ggagcactca gcacgacaga agcatttctc gacctgcttc gaaactcctc tgagaagcgt 

40561 ctcgtatttg tctcttcgag cacaggctcc attagtcgtg cggcagaccc ctcatctccc 

40621 ttccacatag ccagtgccac tgagtatcga gccagcaagg ctgcactgaa tatgatgatg 

40681 gtcttgtaca tgtgccgtct caaggatgaa ggtttcaaag tgtttggagc agatccaggg 

40741 ttgtgtgcca cgaacctcac cggtgatcct gaatctcttc gacgacgaaa tgctgcagag 

40801 ccatctgatg gaggggagag ggtggcgacg gtggtgaaag gggagcgaga tgcggatgtt 

40861 ggaaaggtgt tgggagtata tggagtatct ccattctgaa tgtcctgata gaatttgatt 

40921 gtaccaccag tccgcttatc tcagtctgta taccactgta atcgagtatg ttgtatagat 

409 81 tgattgcatg ccaaaacaat agatggcatc agactaaacc aatcatggca tgtgcctgaa 

410 41 tgagtcatgg ccagtcacga gacccagccc aaagcgggga tgacggcggt gaagccaatc 
41101 ggtggcagac attccagaaa gtcgtataat ttgctgaaaa actggtgggt aaccatccag 
41161 acactttatg ccaccaccac gattacccct cgtttgccgg tcgctctttt cacgccggtt 
41221 ctctcgttcg gccatcatgg agagtcgcca acagcctccc tggcagcaac cgcctgctca 
41281 tcccgacact cgtctgccgc ccctcaaagt atggaactcc ctcaccagat ccaagacgcc 
41341 ctttgtgcca ttggacccca aaggccggaa agttaagtgg tacgcttgcg gtccgacagt 
41401 ctacgacgat gcgcatctgg gccacgccag gaactatgtc agtaccgata tcctgcgtcg 
41461 aattctcagg gattacttca agttcgacgt ggagtttgtc atgaacatca ccgatgtcga 
41521 tgacaaggta tgtgtcacta ccccgccggt gacaatagct ctcgagtgtt gactctggct 
41581 ccaagatcat cctgcgtggt agacaacagc acctgttcaa caagtacatc gccgagcacc 
41641 ccaccgttac gcccgacgtt ctcgaaaccg cccgcaaagc atacagcgcc tacatcaaga 
41701 agaaccttcc cctggtcgat cccgataccg agccggagaa ctttgtcgca gaagcccaga 
41761 aggtctatgg ggttgtgctt cagggagggg ctctggaggc agacaaaaag cccggtgatg 
41821 cggaggtcaa gatcaagatg cacatcaaga ccgtgacagc cacgacgtcg gcgattgcgg 
41881 aagccgctca gaagctcgcg ggcatcgatg ctgcatccag caaggctttt tacgatgccg 
41941 cccaggatgt cttctgcttc tatcttgatg tgacggaggg gtctacaatt cctggcgatg 
42001 cacatgagat cttcacgaaa ttgacgaaga agtacgaaga ccacttcatg agggatatgc 
42061 gggatctgaa cgtcctcgac cctgatgctg tgaccagagt cacggaatac ggccagcaga 
42121 ttgccgactt cgtcgaaaag atcgtcgcga acaagtttgg ctatgtcacc tccgatggct 
42181 ccgtctactt cgacatcaag gcattcgaag aagctggaaa ccactatgcg cgacttgagc 
42241 cttggaaccg caacaaccaa cctctgctaa gggacggtga gggtgcgctg tcgcgggcaa 
423 01 ccgagaagaa gtcttcagat gattttgccc tctggaaggc atctcgtccg ggagagccca 
42361 gctggtcaag caagtggggt caaggacggc caggctggca catcgagtgc tcggcaatgg 
42421 catcgagctg ccttggtagt cagattgata tccactccgg tggcattgac cttgcattcc 
42481 ctcatcacga caacgagctt gcgcagagtg aagcctactg gtgcgagcat aagcagcagt 
42541 gggtcaacta cttcctgcac atgggccatc tgtccatcca gggttccaaa atgtcgaaat 
42601 ccttgaagaa cttcactacc gtcaaggacg ctctcgagcg cggtgactac actcctcgca 
42661 gcttgagaat cgtcttcttg cttggtggat ggcgtgatgg tgtcgagatc accgatgatc 
42721 tcatcaagaa tgcctcttcc tgggaagaga agctcaataa cttcttcgtc aaggcgaagg 
42781 acccctccag cttccgttct tccgatgagg cgccgacttc ttcatccgag accttatcac 



42841 aggccctcaa atctacacaa gaaaaggtcc atgaatactt ctgtgactcc tttgataccc 

42901 ccaaggtcat ggccgccatc tctgagttgg tcacgacgtt caacgccctt gacagtcaaa 

42961 ccttggatct caaggtggtt gaatccatgg gcacatgggt cacccagatt gtcactatct 

43021 ttggcttgaa tggcgctgcc tcttctgaca gctgtggcat cggctgggaa ggcactgaca 

43081 ttccggaagc agccaagcgt ttcctatacc cactgtccgc catgcgtgac actctgcgcc 

43141 aagctgccat gtccggtact cccatcaccg ccgagcaagt gaaggatatc gtcgccacct 

43201 ctgccaccaa ggaagcaact gtgtccgatt ctgacaagcc ctattcccaa gtcctggaca 

43261 atttcacttc tcaattggcc tcgctcagct tggacgatac caagcaggca tccaaggaca 

43321 tactttctct ctgcgatcgt ttgcgtaacg tcgatctctt taacctgggc atttacttgg 

43381 aagaccgcga aaacaagccc gcactcgtcc gtcctgtcac caaggatatg ctgcaggccc 

43441 gcgaggaaca agctcgtaaa gccttgctga agcagcaaga gaaagagaag caagagaagc 

43501 ttgcccagga acggctcgag aaaggaaagc ttaatcccgt ggaaatgttc cgcaccagcg 

43561 aatacagcgc ctgggacgaa gacggcattc ccacgaaaga tgcggcggga gagcctctcg 

43621 ccaagagtaa gagcaagaag ctccggaaag attgggagcg ccagaagaag gcccacgaag 

43681 cctggcttgc aagccagaat ggaaaatgat agattatatg aatcaatgaa taatacacca 

43741 tccgtcttgg ttgacccaga aggatcatca ttctaaatcc gacgtgcata atcatctgga 

43801 agtcatccat tcgtcacctt gatttaggaa ctcatcatga ggtaaccatg ccagtagtac 

43861 tacgcatcgg ttacatacag ccatgtccct ttcctctaat tctgcagttc tgtatcccct 

43921 gtgtcatatc catattcact gcgcattcct aaaatggctc ttgacaatgc tagacgtgtc 

43981 tacagtgagt accattcatg tcatgtacca gctttcgttc tcacaatcgt tggcaaccag 

44041 tgacgcgtag tgtcacggcc cagatttccc atgatgcaaa caagttttac caactcccaa 

44101 ttgaggtata ttatctgatc gtctcattcc tcggttatct tgaccttcgc cccttgcgtc 

44161 ttacctgccg aacgttttat cgagtgtgtg cagtccagct caacaatatc gaccttcata 

44221 ccatcactac ggacctttcc cggctccatg tgcagcggtt gatgaggctc accgcaaaac 

44281 aagcctcgga tatcagagaa ttgtcaatat gcagtgcttc tggaggacct ctgagtcttg 

44341 gttatcgcgt tctcccgtcg tgccccctga agtcacaagc ctcaaaagac ttgcgagaag 

44401 ccctcaaaaa actcaaacaa tgccgcactc tccgaatttc tgcagtggaa attaagcata 

44461 ctgactatga actagcatgg atgggggcta ttgacatgac tcactcagtg ctatccgcat 

44521 tctgtgagac tcaaaactcc ctgaaatcgc tcaccttaga tttcaggaca gataacgtag 

44581 cctcacttct acgtggtggt ctcgagagaa taccgtccta tggccccgct gagctgtctc 

44641 gtttcccatt tggaagttca agaattaacc acatatcact caatctcaac gcaaacatag 

44701 caggtagtcc tggcttgtgt gagagacttt tacatccgct atatacacgc tatggagtaa 

44/61 atgaggtctc acttgacctt ggaaaagcct atgcagagaa tgtcgttctg gatgtcttga 

44821 gaggtaggta ttttaacaca ggagaacctg aggtgatcag attacgcaat gttacaggca 

44881 tgggccatcg catgcttgag ccaaagttaa agcagtattg ccattctttg cgggtcctga 

44941 ctatggagaa catcatcttc agatataact attggctgag atttttacac agcttaaagc 

45001 gtgaattccg ccgtcttcaa gaggtaaatc tgttttggga ggtgaacact agttcggaag 

45061 tgagtttgcc ccttcagttc ttcggtactc tcgctaagtg actctagagc acctcgctca 

45121 tcactcaatg gaatcacaat acaatctatg acaacactca aactagggtg acctacactg 

45181 gacctgaaga acacggtgcc cttactgctc ttgagaagat agccagtgaa gaccctctct 

45241 ccgttcatct gtgattccgc caggatgaag aataatctga tacatccgct ctgtctgctt 

453 01 gtcatgatgc agtatagttt gtcaccgttc ttcaacactg ctcttgtgaa ccagcgaagc 

45361 tgctctcgta tggatggaag cctgcaattc catcccatag agtatgaaat gtacgacatt 

45421 gataaagtca tgtgtctatc agattcaaca ttatagtagg gcataggtga tatcaccaca 

45481 tggtatgctt gattgacgca cgcactctct taccacctct tcaacgctgc agtacctggc 

45541 cgatcaaatc gtggccaagc gattgtatac ctttaatgta gtgtatcctc cgctgaagcg 

45601 ctctatcaaa agagtgttat aaagtatatg ccgttctaca agtcaagctt tgaaatcgca 

45661 ccactatcaa agcgaagaac tgggcctgaa aatagagtca tcaagccgta cagtagaatt 

45721 atgtagaccg gaatctaaac tagcagagac actatacaga tatctgtatt agaaaaacat 

45781 gtgagtagac aagagttgtt tcaggagtga agcaacccat caaagctcac ctgaggagcg 

45841 gtggtggttc tactccggtc acctgctcct tggggaagat acttacttcc cctggatctg 

45901 cccaccaagc agctctgcct tgacaccgca tcacacaacc agtctccaaa agagtttcgc 

45961 agctttgcta tggtggaacc atacagaagt caaaactcgt ctttatactc ttccgtcttt 

46021 ctgcttgatg tgtttctttg ttgagttcct acataccagc atccccgaga attcgctgcg 

46081 gctacgcaac aaatatcaca ccactcctat aatcctctac catgagctca tttcccatca 

46141 tctgtcagca acagctgaaa cagaatacaa catggtgcgc tggtacacag gattattgcc 

46201 taacctcact caggggagat aaagttgcta gacttacagt aggagagact tcactacatc 

46261 ttgccggcga ttcctatatt gccatcaagc gaagtttcag agtgtcgtga acatccaaga 

46321 actggaaggc tcatgactgc accttcagac ctggaagtgc tgacattgtt atcgtaatgg 

46381 tggcgtagtg ccagcttcac taaaggacac cggccataca tcataccact tgattagtcc 

46441 cgtcgacttc ctacaactcg caagttccca tcccattctc cactattcca atatcggaca 

46501 cattataaca cttgattcgc tcatttcagc ctcctgcaat atgcatcaaa ggactcgttc 



46561 tccacttgaa agcctcaacc tctccctctc aagactgtca agggacaaga agatgcatgc 

46621 attccacttt cacctctcac aagaagcgaa gcgtcagaga ttgcaagaca agattctcgc 

46681 aacccacgct gggatcgaag aatactcggc tggatatcta aagccatggc aattactgaa 

46741 gttgaagatc gacgcgtctg aagtcaacat aagcagcaaa gatataatca agtttagacc 

46801 gtgctttttc tctcccttac ccactgacac tatacggacg tgcctatcga agcagccaaa 

46861 caagaaatct ccagatgaaa atagtttagg ctatgcagga cgtgcaatgc catgcccaag 

46921 cacagtcgct ctgatgcatt atccacttat ccttgatctc tttcgagaga aatatgcaac 

46981 ctccgtacgc aaaggtttct gcttggacag cctgaccact cttcgagagt gcgggtaagt 

47041 tcgtataaat cttccaaata tccatatgtt caactaacgt gcagaataca gagccactag 

47101 caagggcaag tggaagcagc aaccaggcct cgaactcgcc gagaagctat ctaccgagcg 

47161 ctggcaaatc ctgcgcattt tagaggatct ggagcatgaa gtaccccacg ccgtctgcga 

47221 tatccgtcag aacatccaag ttgacagaaa gaaaggtctc acggtagcgg aagtgaaagc 

47281 cattatgaga gttatggctg tccgcatagg gctggactgc taccggccgc atttcctaat 

4 7341 gccagtgagt catttgcaac ccttttgttc tcagtccggt caaatgtctg actgacgagt 

47401 cagatcctgg ctatatcata cttgaatcga aagcaaggac gaatcttaca agtacaccat 

47461 gatgggaaac gggtcgttat ccagtacacg aaagtgctag actttgatgg acttgaacat 

47521 caccctatcg agctcttcct tcgctactac tgcagccagc cagtcggaaa gaccgccaat 

47581 aagcagaagg attcatcgtc aagatagggc tagagaattg gctccactga cgccccacgg 

47641 tggtaaggaa gggcatcact tccaaaggtc aagaagaatc taggctctag attaccagcc 

47701 gtcttgcatt cacactatcg cggtttcgcc cccgccaaac ttggcatcgc aaatgcgata 

47761 acgctatcgg cttggctccc caatcggaag caatgcacgc cttatctaaa tcatcgactc 

47821 taaactgctc agaatatagc aaagcaaggg attccttaaa gaaatcaact gttcgacgtg 

47881 acagaaaatc atcacattag acgattcgca aattctgaag atgtctcgga ttcttcagag 

4 7941 gtcgccttac ctgtgtaaca ttacgattca cgaggccagc ttctccgcat tccgggggtt 

48001 tgattgcagg gtacctcggc cgttatcaag ataggcctgg ccgacatgct tcttatcggc 

48061 cttgttacag ctgcccagac gctaacgacg ctcaggtgct ccaagtcacc acgcgtggcg 

48121 ataagcactc aagctttctc ttgatcggcc tcctcttaaa aggaggggtt atggcctcct 

48181 gcgatgtctg cggtatcccg ccctcaatca ctgggcttct tggtcattgt gatactcgtc 

48241 taccgcggag atggctgcca aagagatgaa tgagaaatca ttcaaatccc cgcaagttag 

483 01 cgcggatatc gagattcaac aaggcggatt cgacgaaata caaccggaat ttgacctttc 

48361 tgaacaagag aaggagcaaa atgactttga ctatgaatca gagcattcgc ctttcccaga 

48421 aggtatgcgg cctttcatat acacactgac ccctctaatg ggtgacacgc ctagtacgcg 

48 481 cggtcgtccc tgagacggac gacccaggtc tgcccgtcaa cacattccgc atgtgggtgc 

48541 ttgggttcat cttcaccatt gttggatcgg gaatcaatca attcttctcg cttcgatatc 

48601 cgtcggtcca tattgtgtca ttggtagcag agctactggc ctatccgtgt ggcgtcttcc 

48661 tggctaaggt cttgcctcta tggactataa gccttggacg gcttggatcg ttcaccctca 

48721 acccagatcg ccatttcaat atcaaagagc atgctctcat agtcatcatg agcaacgtat 

48781 ctttcggcta tggaagcgca gattccacca atatcatcca ggcgtcgagc gccaggttct 

48841 acaatttcgg tctctcagcc ggcttttctg tcctcgtggt cctttgcgca caattgttgg 

48901 gatttggagt cgctggcctg gctgcaccgt ggctagttga gccagcaagg atcatctggc 

48961 cccaggtcct ttctaactgt gccatgctgg agacgctcca ttcacgcgca aacaccgtcg 

49021 ccaatggctg gaagatttcc cgtctgcggt tcttccttta cgtcactgca ggtggtttcg 

49081 tgtggtattt cttcccaggg ttgatgttca ctgccctctc gtactttacg tggatttgct 

49141 ggatcgctcc ccgcaatgtt gtcgtaaatc agttgttcgg aatgcagact ggcttaggac 

49201 taagtcctat cacattcgat tggagtcagg ttgcctacaa cacgaacccc ctgctgtcac 

49261 cttcatgggc cgcgatcaac gtctttgccg gattcgccct gttcttctgg attgtcgttc 

49321 caggaatcta ctactccaac acatggttca cagcctatct cccccttatg actgccgacg 

493 81 tctatgaccg gaccggcacg gtctacgaca cagcccgtgt catcagcgcc gacaacaccc 

49441 tggacgtcga tgcctacaga caatattcgc ccccatactt accagctaca tacgcattcg 

49501 tctacggcct gtccttcgcc tcaatcaccg ccgttctaac ccacatcggc gtctggcacg 

49561 gcaaagaagt ctgggccgca ctaaaaggca aaaacaagct cgacattcat gcccgtctca 

49621 tgaaatccta caaaaagacc ccatggtact ggtacgccgc catcatcgcc atcatcaccg 

49681 ccatcgccat cgtcatggtc gaggtctacc acaccaagct ccccgtctac ggcgtcttcc 

49741 ttggcctcat catccccgcc atctacatgg tcccctgcgg catcatccaa ggcatcacca 

49801 acgtcgacgc caaccaactc aatgtcctag ccgaattcat tggcggctac atgttcgaag 

49861 gcaagcccct ggccaacatg atcttcaaga ttctcagcac cgacgtcgtc ggccaaggcg 

49921 tctacttcgc catggatatg aaactaggcc actacctcaa aatcccccct cgcaccctct 

49981 tcatggccca aggcctcgcc accatcctcg gcgcccttac ccaagccggc gtcaccatct 

50041 ggatgctcgg ccacatccaa gatatctgct cctccgacca atccgacggc ttcacctgtc 

50101 ccaacggccg aaccgtctac tcctcctccg tcatctgggg cctcgtgggc ccccgtcgtc 

50161 tctactccgt cggccgcatc tactcctccc tcctacactt cttctggatc ggcgccatcg 

50221 cccccctcat cacctacttc ctctacaagt acacccgcaa acaattctgg aaatacatca 



50281 actggccgct catcttcgtc ggaacatata atgttccccc cgccaccggc atcaattact 

50341 ccagctgggc cctcgtgaac ttcgccttta accacttcat caaaaagcgc ttcttcgctt 

50401 ggtggaagaa atacaactat attcttgctg ccgcgctgga cacgggtctt gcgcttagtg 

50461 gtatagttat cttcttctgt atttcgtacc ccggggctgt gtttcccgat tggtggggaa 

50521 atacggtcta tgtcaatacg gcggatgggg agggcgtcgc atataagtct atgcctgagg 

50581 tcgggtattt cgggccagcg aatgggacgt ggagttgatg tggttgtact agtcatgatt 

50641 gttttgttgc tcgtgggtgg gaattgtatg tatttatgaa ttgaagattg ccgtggtgat 

50701 cgctatggta tggaccatta ctatcaaata aatgattttt catccttaca tatcaagtga 

50761 tagagctgga ctgttagtaa ttgtggtaag tggttgatgt gttggttggg aaacaatttt 

50821 aggagaggtt cctgacatag tgatgcagtc atgtttatat gttactgtga tgttgtagca 

50881 gatgggcggt aatttatctc cttcagagag caaatatact caatatccga ttaacctgcc 

50941 caatacgtct gcaacaatcg cctactatta caatgaacac atgtgcttgc cgtttttgat 

51001 tgagcaagta tattagttac atggctttcc ccatcccacc aacgatatcc gagtagcact 

51051 gacaacatta tgagagaaaa aagaaaaaag gaaccgggca cactagaaag acatatccga 

51121 agagcaatga tgaaataacg aaagcttgaa actcgaatcc aaccggcaga cccagagtgc 

51181 gtaaagattg gccgagtggc cggtcaaagc ctggcccttc gcagtcgata caattgactc 

51241 acgcggtcgt gtagagtcaa acgtaagatg ggacatttac taaatagttt attagctctg 

51301 gttcgcacaa caactcgttg acaaaactta cggagatctc cgccattccg aggtcttcgc 

51361 cgccggtagc ggcaggctgc tggggggcag caccgtttgc ttgggcattg ccctccgcag 

51421 ggccagcatt ctcgttagtg gtagagaagt agtcaaccat ctcagcatca agctcctcca 

51481 cagtcttggg cttagggcgg ttaccacggc cgttagggcg acgagcacgg cccttgcggc 

51541 ccttggcacc agcagcagcc ttggtagcgg tagcgggctt cggttggggc ttggtttgtc 

51601 ttcatcgcgg ttagtcatga ttccccagta caagacaatg gaagaggaca cttacgcaac 

51661 gcgctcgccc agaggcttag gggcagagac ctcaggagcg tgggaggcgt cgacgacaac 

51721 ctcaatctag tatttgttag tgtccatacg taagaatagt gattggttag tgaaaagctc 

51781 accttcatgg gtctgccatc aacaaggagt ccattcaagt ccttagcagc cttggcagct 

51841 gtatcgggct tgctgaacac gatcgaggcg ataccacggc tcgtgccgtt ttggttgtag 

51901 gtgagcatga ctcgcttgac aggacctgct gactttgtga agtattccta taccaggaga 

51961 ccaaattagt atgtcgagcc aacagaatgc agaaggcgca aggaattccg aagagccaaa 

52021 acatcgaaga acgcgtgaca ccggcttcaa acgcgatacc aaagagcacc agcgagcaca 

52081 cagcggcggt tagtagtaat cagtgcatgg cggttcattc cagaacaccg agacactgtc 

52141 atatctagag cacagagctg ctagcattta tcccaagaac cacaacgatg aactccctca 

52201 aaagccttca cagtatcgtg tgtccccctt ccaactaacc cactcagcaa tcgtcatcga 

52261 ccttttcgca accagacaag gcgctagcct caaaagctac gggctgtgtg tgccagcaac 

52321 cttcgcccgg ggacagacaa cacgaagact acgtgtgcgg caaatccatg gttctgtttc 

52381 cctagaacct ctctagcagc ccatgatagg gagaacaaga cacgatttgt ctcacggagg 

52441 tgatatggga agatgcgtct ttcctgcccg acactgaaac atttcccata ccgaccgccg 

52501 gtagcatgat gcttgacagt tgatcacctg ccgaactagt cggtcactgc tggtaggctg 

52561 atgcgctcgc tgatgcccga cacctcgaaa cgtgggactt tgtacgggtg tagccaatcg 

52621 gattggtttc gaagatggta tgtttcaaca atcatctgtg gcacagtctg ctgcgtggct 

52681 tcaaaggcgg atggctggaa gatccgcccg agctaacacg cctttcgcaa gactacggca 

52741 gaactccaac ttcggcggac tcgcagggtg ccagcgtgca attaccgcat gcgaagattt 

52801 caatggagtt tgctgcctat gctgaagacc aagaatccat gacggtgata taaaatatat 

52861 atagataacg tggcaaggat cgaaacgcgc gcgaaagccg ttgcgcctcg aaggtttcga 

52921 gaagaaaaca tggcgataga aacccaaggc caaggccttt cgatcaacat accttgatat 

52981 tggcctcatt cacatcggaa ggctaaaaac gttagtaacg gttcatttat caatagagca 

53041 gagataacgg gtactcacca agccgctgac catgattttg ctctccgtag agaccgggtg 

53101 tccattctga acagcctttc cgggcgcctt ggcagccttg gtggtcttct tcacacctcc 

53161 gactgggaca ggagcgttgg cagccttgtt gggagccgag cggcgagcgc gacggcgagc 

53221 gccctggcga cggctgacga ggatctcgtc gagagacttg tccaacttgg cagacatagt 

53281 gagaagcgga gtatagaacg tggaggagtt gggtgtcggt agttgatgaa ggtttcacga 

53341 gagtgttgag tgttacaggt gttgcgccac ggagaatcgc gacgctggaa cggggttgtt 

53401 tctggaggta acggtgtcca gaagctggca aattaatctg gacgacgggc aggtagcaaa 

53 461 aatgtcgtgc tagatgtaga ttgcagcgtg tggagataaa agctgcaacg tgggttgatg 

53521 gaaatgttgg gaatagagag gatggatgga ggatgaaagg aagggaaacg gaagagggat 

53581 ggagaagata agcaaaaggg aaaggcggga ggttcagttc cctctcagtt cgtcagccgg 

53641 atcttcttcc caaaaaggct gagccaccct gcccaccgcg ctagtcccca tcgggcccat 

53701 cgggccaaca ctgcaaacaa cacagcctgg ctcctttcag gccggtgccc aagcagatca 

53761 tggcttcaac tacagtccag cttacttcaa gctaatgtaa caacgctacc tagctatgca 

53821 atctaatctc gtatattcat ttcgacacat gagtccagac tcaataagct cgcggtgatc 

53881 tgtaaccttc cgctgcgcca agagtccctg ctgcccaaag actcaaccga agcaatcaag 

53941 cccagtgcca gcccaaaatt gcgtagaagg aaacatgatc atgtatattt acgagcaggg 



54001 atattatcgg ggaaattctt caaatcaaca gaaacagctc aatccgctca aggtttccag 

54061 ttaaggccag tcccgtagtc aacattagga cgaaatgagc tcattcgata cttaagatca 

54121 tcccaaagac tcccgccatc ctacccaaac gccgttcata acatgaaagg aaagcaaccc 

54181 atattcaacg gcatattgtg actcattatc cattgtcaca taggcccgaa gatgcagatg 

54241 cacgcaaacg tcttcattgt tcctcagctc ttcaaggctt cgagccgtcc tcgcatctgc 

54301 tctagtgtcg cttcttgatc cgcgaactcg tcctctggct ccggagcctc ttccaatggc 

54361 ttctcgggct ggaccccttc gacctgagac aacttgccat ggaggatctc ctgcaggacc 

54421 ttgtcgacct cctcctctgc ttcctcttcc tcttcctcca gtaactcgtt gttaggaatg 

54481 gcgtcatcga ccatctcctc gatgatcccg gcccgcacca gttccgtcga aagctgacgc 

54541 atcgtggcat tcagctcggg catctggacc aatgtgttga cgtccttcat aatccccgtc 

54601 gatttcttca agctgccctg aatcttccga accgagaagg cctcattgac ctgcatctga 

54661 acactctgca gttgcgcgcg actggtatgt aaccgggtag actgcttgcg aatgcgaaca 

54721 agctctcgtg caaaggtctt ggcctccatg ttggcctgct tggcctggga gggattgcgt 

54781 tgggcccgct tcgacgaatt cataatgtat tgtcgtgtct tgctttccag agtcttcagt 

54841 tgtgcgatgt cacgatccaa ctgtcgggta ttgacgcgga tgagctggtt acattttctc 

54901 atctagagac agcaatcacg gtcagaagcg ccagacattg tggctggaca aaatcggcac 

54961 ttgcctgtgt ttgaggatcc gggccgaaaa agactgcttt gagggtctcc atggtgaatg 

55021 gcgcaagaga aatacggttt cacagttcga tatctatagc tgtaaaagtc tactttggct 

55081 tgaagcggga acggtcgaga gagatgttgt cagggtgctg gatcgctcaa atctggagtg 

55141 actttggtaa tacggtgggg tgtggattgg aggtggtcaa aggtctttcc cccacacttc 

552 01 tgcgttgagg agcgtgatag ttgatcttct gacaaacaaa aaaatacagc aaataagacc 

55261 agatgccaga gaacgtgtcg aagaggcgca ccaaccagga tgtggactat tgcatggctg 

55321 ctggctgctg cggtgcctgg gctccaagat ggacggatgg agtggatgtc tggctgttcc 

55381 actcggcggc tgagtcagcc agatacggac gggggcagtg acgtcgctgg gcggttttga 

55441 gcctcaggca ctgagaggcc gcaaacagaa caaacgggcg gcgggagagt tccagtactt 

55501 tctagcattc atcactagtg actcgccggc gatcagcagc taagagcaac ccgcctgctg 

55561 ctgcaacttc actgtctagc cctcccgaca tcatcctttc cccattacct cccttatttc 

55621 gttcttgacg agattgctac ctccccgtct ctctactcaa attcatccta tccgctatct 

55681 accatcatca tgcccgaact cccccaggtc caccccgaag gtgagtttac tcccccctct 

55741 tgtccctcat ttccttcccc gctctgggtt ttcgtgctgg gaggtttcgg ctaccgggga 

55801 gttgtcatgc cataccatgg ccttattgtt cccaatatta tcgccctgcc tatatttcca 

55861 tgcacacatt catctctccc ggcatgagat actgacaagg gtatttcact agtcacatgg 

55921 gctcagcgct cttcggactc ggatgccgaa cgcaactacc tctacgttag catcaaggct 

559 81 gccgatgttg cccgtcctgc cgcaactatc gacatcaagc ctaccagtgt cactttcaca 

56041 ggcgactcca agaagggcgt aaaataccac gtctccttgg atctgtttgc cgagatcgac 

56101 cccgagaaca gcaaggtcaa ccacactgac cgtgaggtcg agctggtgct gcgcaagaag 

56161 gagctcaagg cagagtactg gccccgtctg ctgaagacca cccagaagat tcatttcttg 

56221 aagaccgact tcgacaaggt acgataccgg ttctagtttg catattctta ggaaacagag 

56281 tgagctgact acgaggcagt gggttgatga ggatgagcaa gacgaggtga acgaggacga 

56341 ctacgcaaac aacttcggtg gtttcgaagg ccttggagga gagggtggtc tctctaacat 

56401 tgacttctcc aagcttggtg ccggtctcgg cggtgatgcc ggtgctgctg gagaggaaga 

56461 agaggtacgt tgtccaccaa aaccaaccga ttaatcaaga cattcagtca ctgacaccct 

56521 taccaggagg aggaaatgcc cgagcttgag gagggcgagg ctgagggtgc caagtcttcc 

56581 aagatccagg aggtttccta aatgctatgc ccttctactg agataggcaa cgactactac 

56641 gcaagcaagt aatcaaaaaa aaagcaaata atgccgagcg acgaatcctg taacctttga 

56701 gcttttcttt gctcattgtt tttctttcac ctatactata ccgctgatga tcacggttca 

56761 tggtttggag cgacctggcg cacttggaca ggtttctggc ctgcctcgag aggcgaaggg 

56821 aaacggaaaa ggaattctta actatttcct cttattggta ctgaaatgac gaaatgacta 

56881 gcaaggattt ctcttttcct caaatggccc attcctggac tgcgatccat gcgtagaaga 

56941 gatgctgaga gtccgaccga gaaacgaaga cttgcgaccg gacagtgtag taacaccacc 

570 01 accatctgta cagagtagtt tgcagtgcgt gtctgtcgcg tgattaaaag cccagaaatt 

57061 agcgccaatt gaaccagact atcgagttca cgtctggcga acagaatatg cgcaaaggac 

57121 aagaacctgt ctggtgagga gtttggtcgg gacggcgaga ggagaggaac gaggagaatc 

57181 caactcgatt ggcggagagg ccgattatgt gaagttgttc cgccacactg gcaaacccac 

57241 accgtgaaaa gtgtggagga aaccttgcac ctcctcacgg attgacctct ctctcccttc 

57301 ctcctcattt agacaatctt taatatttcc ccatcttctt tcccctcccc atcagaaaga 

57361 cagccgaggt gtcgtcagag ttcttgacct actccgtcgg gacgtcaggt cgacccctcc 

57421 tttggtgatt tttcgttgcg gttttcttcc cttttgcggc gcatcttttc ttcttccctt 

57481 tctttccccc ctccctcgac gttggatttc tgtgggaggc tttccctcat cgtatcggcg 

57541 cgcaaaagtt atatatccgg catagaattt catcgctatc tttttaaact tcatttcttt 

57601 atacatatta gaggagtgca aatctacaaa aatccgccaa gatgcctccc aagaagcagc 

57661 agaatgagcc caagaagaag aaggccacgg tcgaggataa ggttcgaatg ccactctatc 



57721 aatcgatatt gtcgactaaa tgctaatcta cgattttgta cagacctttg gcatgaagaa 

57781 cgtacgtaac gagcccttct agacccgcat accactccct caaccatcca ttatcgactc 

57841 ccgagaaaag aaagtacaag cccactgacc ggtatttctc tgtacagaaa aagggtggct 

5 7901 cggccaagaa acagattgct cagctacagg cccaagcggc atccaacaag aacgccgatg 
57961 caaagaagaa ggaagccgag aaggcccgca gagaggccga gaagaaggcc gccgagcagg 
58021 ccaagaagga ggccttggag ctgttcaagc ccgtgcaggt tcagaaggtt cccttcggtg 
58081 tggatccgaa gactgtgctt tgcgttttct acaagcaggg taactgtgag aagggtcgca 
58141 agtgcaagtt cagtcacgac gcgagtgttg agcgcaaggc ggccaagaag gatctctaca 
58201 ccgattcccg agacgtgaag gcggccgagg aagaggcgaa gaagaaggat acgattgacg 
58261 actgggatga ggagaagctg cgcaaggtca tcctcagcaa gcacggtaac ccgcgcacga 
58321 ctacggataa ggtctgcaag tacttcatcg aggctgtcga gaaccagaag tacggttggt 
58381 tctggacttg tcctaatgga ggtgacaagt gcatgtacaa gcacagtttg ccccctgggt 
58441 atgttttgct cgttaagctc ttgacctatg tttactacag ttgctgatgg actacagatt 
58501 cgttctgaag acgaaggaac agaaggccgc cgagaaggct ctcatggaca agtcgccact 
58561 gaacaccttg accctcgaag actggctgga aagtgagcga cacaagctga ccggcaacct 
58621 gacacccgta acgcccgaga ccttcgccaa gtggaagaag gagcgtcttg acaagaaggc 
58681 tgccgaagag caggcccgca aggccaagga aaccactgga cggacactgt tcgaggaggg 
58741 caactggcgt gctgacgacg agagcagtga cgaggaagag gacgatggcg cgtggaacct 
58801 ggaagctttg cggcgcgaga cggagaggat ccgcgagagg aaggaagagg aacgcctggc 
58861 agcgctgcat ggcacaccag tgccggtttc gaacgacgag acgattgcac aggaaggagc 
58921 tggctgagtt ctatgcgata cttccccaag gacgcaaaag gcacttgggc gaaaggggtt 
58981 gcggacctcc cccacactgg gaacggtgtc ccaagccgtc gtccggacct gggatgccaa 
59041 tcgggttgat ggatggtaga tacaatgctg attctctttg ggttctttgc attgtgatga 
59101 agatatacca aactagatcg ccttgtatcg ttgtccgcga ctgggttgga tggctcttgc 
59161 ggttaaacgt gaacggttgg tcgtgtcgcc tggattcctg gttcaaacac cgttggatcg 
59221 gtgtaagctg aacagaatcc aaataggcat ggattggggc ggagtgaaat atcaatggga 
59281 gagaagatac cagattgatt gattgagaga catcgacagg gtctggaggt tcgttatcca 
59341 ggggatgaac tgattcatga tatgatttgc tatgcccgat ctgtacgtag aacgtttgtt 
59401 tttttggttg aaatgagcca gcattgccct gaacgtgcaa atgaaaagtg caattgatcc 
59 461 aaactatctt gtaacacatt gacgccgaat tggacggtct ggcttgtacc ttgctgctcc 
59521 tgctgctgga aaccctcagc ttacattcca cgtcagggga gggttggaag ctgtctgtct 
59581 cacccatgaa tgttgtcact cgtcggctgc tgctgcacga tcttcccaac tgatgctcac 
59641 cgatctgtcc atgcaatctc caccatccct ttcttcccga tcgaactatc caggaacgcc 
59701 cgcccatcct ctccctccgc ggatctcact attccctcct cattcccgaa gtctcttatc 
59 761 cctctccacc atggcgcacc ccagtcccct cgacaaactg aatacataag gttaagacga 
59821 tggcagctca acagtaccgc cacagtcccc tcgttcccag tggtcccatc cccacgccgc 
59881 catgacccag tccttcgtct ccttcaccgc cgtcctgttg acgggattgc tgtcgcattc 
59941 ccagcctgcc gcggcggtgg gaatcagctg cgatccaaac acccgtccat actcagcctg 
60001 gatgacagat agcgtcattg cccgacgaga tgccattcta ggtgactttg cagcaggaga 
60061 tgatatctac ccgaatatct ctcggatcgg gctgtttgag tcaaccgtca tccggttgaa 
60121 ggattactac gatgccgatt cttgcggcca ggtggattgg cacagttaca cgctgcaggg 
60181 cgcggaaagc cttcttccgg cagtgttgaa tgtttcgttt gatgtgaatg ctccgctcac 
60241 tgtgtttgcg gcgggagatc ctatgtatca ccagtaagtc attcaatctc accacttaac 

6 0301 tggctggcct cagggacaat caaaccgaca tctatattgt tagatacgat cgcgcgccaa 
60361 atgagaccta caaacaagcc cttgataccc ttcacgatgc ctttaacgca agaacccgcg 
60421 actcagaagg tcggacaaca cccgcgtcaa taccaacaac tcaatcccat actcaaaatc 
60481 cgaagcatac taattatatg aatatccagg cgccttctgg tacttcaacc cctaccccaa 
6 0541 ctggggcgtc cttgagggcc tctaccctct cggctccttc atctccatgt ggaaaaccta 
60601 cttcgagccc actaacacca ccctatccga ctccctaatg ctcagcctcg atctattcaa 
60661 agaccactgc cacgacgact ccagcggcct cctcttccac ggctacgact ccagcaagac 
6 0 721 cgcctcctgg gccgacccca ccaccggcgc gagttcctac gtctggaacc gcgctctcgg 
60781 aaactggttc atgacgctgg tggatctact tgaacgatca ggcaaaaaca gcgcgattct 
60841 caacgagacg cagcgcgacg acgtctacac gaaatactgg aatgtcgcta acacgatcat 
60901 caacgacgct gatgaggaaa ctggatgctg gtggcaggtt atgttgcatg gtggggaaga 
60961 gggcaactat atcgagtcta gtggatcagc tcagtttgtg tatggattgt tgaaaggtgc 
61021 cagattgggc tatttgcagg ggaagacacc caatggagtg ggatatactg atgcggccga 
61081 taagtgctat aactatctgg tgagcgagtt tgtcaaggag gaggcggatg gctcgttggg 
61141 gtacaatggg actgttgggg attgtggatt gtatgatccg acctttgagg tatgaatttc 
61201 ttcttcttcc attcttttgt gattcggctg aatgctaacg caattgtatt ctcaaagtac 
61261 tacaccacca gacccatcga atacaacaac ctcttcggca cttcgtcttt catcctggca 
61321 agctgggaac atgagatgtt tgagacgggg tattatgatt agctagctag ctgaggtgga 
61381 tttacttact actaatacta ctaataccat tactactcgt tcttgcgata tctggtaaat 



61441 cctatgacag ctcttataat atacaatacc atctaacagc attactgagg ccgatgataa 

61501 aatagaacga cttcacagat aaatacatat aatcaacaaa tcaatgcaat tatattaatc 

61561 ttaccccaga ataaatgcta catctagata gtagatggga cgtcggagac aaaaaaaaaa 

61621 aggtgtggag aggggcggga agggggaggg gaggagtcat tgagtttgaa accatgtctg 

61681 agcgtaggtt acaacttgag acatgaagat atagtatctg caaggggggg aagaggggta 

61741 gtgctattgg gaagtgggaa gggggataaa actgaagttt agtgctggtc ctggagactt 

61801 aggcaagtta gtttttaagc agggtcttga gacaacttaa gttcgtgact attgatttag 

61861 gtcttgcatc cttggacgac cttgggtgac agttgcagtg ctcagctgac cctcttgaag 

61921 aaggtggatt agggtcttga ggtgacctaa atcttctttt acaacttgca tagcagagta 

61981 cggaaagtca atttgcctat gggatgttca aggaaacaat aaaaaaattc cggtcaagac 

62041 attgcaatta cgcagacagg cagagagtca gtcaaaatct ggcttagcac taaagtgata 

62101 tgttaagatt gtgcggacca agtaagagat atcccatgtt ctcatctcat tgcagtcatc 

62161 actttcatca ttgtcgatca ttctgttgaa attggccgtt cacataaata tctatttata 

62221 tatgtacggc gcaccgataa taaccatcct tttcagttct tgagaggctc ttggggcgag 

62281 cgcagaacat cctcgaacca cttgtacgac gggaccagac tgtgagggta tcccatgtgg 

62341 ggtagtccat ggaagaatct aaggggcgat catggttaag gactaggcaa atgcggaatt 

62401 ggagggagct gactgacctt ccctctttgg ggcttccgtg attgaagagc atcaagcagt 

62461 cttcgatggg ggtgacgcca tcatccacac cctgcatagc atgggtcagt atcatacaga 

62521 agatcgcggc tgtatcggtc ttacgttgag cagcaacagt ctgcagctag gctgatccaa 

62581 gatgcccgtc tctagaagag agaacttctt ctggccatgc ttcttgaaat cctccgggtt 

62641 ggagtagccg tacttcttgg cccaggcagg ggtgagccta caaaatgtcc cattagtgtc 

62701 gttagcaatg acacacgtgg tcacagtact tacaggaatg ggtactcgtg gtcatcaaca 

62761 cggctcaacc actcggggtc caggaagtaa tggcagcccg gtccgtgggc gagagttccc 

62821 acgaagcgat cacggtgggt atgtgcagcc cggatggcgt agaacccacc agcacttaga 

62881 ccccaggcag caacccggga catgtcgaac tccggacgag tcgccatgta gtcgagaacg 

62941 gtagtccact gacggtccgc cgattcgggg tcgctgggat cggcaggcga gtccgcggta 

63001 ccggggatct cacagataac tgttgcccaa ccccggttga tgatctcatg cgtacgctga 

63061 ctgttgtcgg gacggtatcc atccagacca gtcattagaa ctacaactgg gacggggttc 

63121 tccgccgtcg catgttccgg gacacgaaca aaaaggggaa cataggctcc gtcggtggcc 

63181 gacttatgct tgtgcgggat gatgacctcc tggatggtgg gcttccagaa ggatgctgcc 

63241 ttcagataga ccttcttctg gcggttgaag gcctcttttt tgatgtcttc tctggtcggg 

633 01 tcgacatatg ggaagcggga gatacggtag accacagcag ctcttcggag caagtcagat 

63361 gccatctcca cattgttgtt gttcaaagca gctgacgcct tattttcgag tgcctcagca 

63421 accgggaaga aagcctgcgt gtaggcatcg gtgtaggcat cgttgatatt ttcctatcta 

63481 gaagtgagta ttttaccctt gcgctcgtaa ggcgcacatg gcaagattat gataacctac 

63541 atcgataagc ttttggaaga taggtcgaaa gtcttcgatg gcaccgtcgt gaaacggata 

63601 gactgatttt tcacactgca tgctacgtta gataagacct ggctgagcgg ttggccaacc 

63661 actgagcaag gggtggtgat gccgaccacc agtgaaaact tactgcaaac ttccacttcg 

63721 tttcccagag agctttgata gagcccttgt gggggtaaac ggtgttgaac ttctcgccga 

63781 ggatccaagg agccattgtg tttgcttgct tgttcgcgcc gtccagtctg tgagtagaca 

63841 atgtgaagat ttacagttca ggcgggagac agcattgaag gacccttttt aaacgtagcc 

63901 cattcaatgt aaggtactca cggcgaaact tgccgactgg gttgcacatg aactctcctc 

63961 caaaccttgt gacatcacaa ccaccattga gagtacttag ggctacaaac gcaggggggc 

64021 tgaccaatcc acggccgctg agggaacaag gggcgtcact gtctgtctcg ggtctcggac 

64081 aatcggtaag gcttgacaca ctgttccgct gacagaggtt ggggtgagag tttgctttgg 

64141 gtggcagtag ttgaaactcc cttgcctgcg ttgtgccatg ttgacgactt tccgctatac 

64201 cagatcatgc tttaagtccc aggcatccat gttcgagccc aaccaagagc ttctacttgg 

64261 aaggaaagag accgctaatc cgggagccta aacttcttgg agtatgtgtt gctgcacaat 

64321 taaagtagat cgaagcggtc gattgccgag gatagacgtg attggttggc gtacatatta 

64381 tactcattgc agcttaacta gagtcagaaa tacttcatgt aagtgctcga tgttgagaac 

6 4441 atcgtggtag ccacctggtg gacccaaata tgtactgcct acgggtgaac agggcatgtt 

64501 cacaacggta gttctttcat cgagtgaatg ccttgcatcc ccccgattcc aaggcggaag 

6 4561 gcgcattcca atgcctgctg tcgcggtctg gatgaggaag gattgttctc tgcttcaagg 

64621 tctcggtcgc ggagcagtgg tatggtcagt ttcgaccaag caaatatgct ttgacaaaag 

6 4681 aggccaagta ttccgaggcc ggagacttcc agaggcctat tggcacaaga agaggaacac 

64741 aaagcatgga gacttactct actccgtaga gaaggcatca agggctagcg tgcagacaac 

64801 gacattgaag cgagttactg ttgcaacagt agtgctcgga catctatcgc acagtgcctt 

64861 gcgatttggc ccccttcata gctgaagtaa ccgccattag gcgtcaatgt atccagctat 

64921 ccagaaccag acaagtgtgc caatacctca ggaattttct tgccggggca tcctgcatct 

64981 ctgaactggg aaggtatctg catgtaggag aagctggccg actccctact tctccttccc 

65041 catcccttga agcaaggaca ggttacatgc ttgcattcta tgtctatgct ggaatgattt 

65101 cccccgggcg gcgactgaaa gcacgactcg catcggatcg gctcatttgt aggcttgcgc 



65161 tccggccatg cgagaaatgt ttgtttggat ctgatgccga ggcggcgaca ccgacgaggc 

65221 atccaagtcg ccaagggttc ctttccttgt tgctccgcgg gctgaactaa ctttttgttc 

65281 ataatttagc agcccgagca gttgtctgta catacttatt ttcgacttcc cttcaggaat 

65341 tgctccgaaa gtctgcgccc gcgtccggtg aatgctatac ttgaagccgc ggcaatgatc 

65401 cgaaccctca tcacgctcgg cctgtacggg ttcgtccaac aggtatcgag caaagatgtg 

65461 tacctggatt ggaatataac ttgggtgact gcagcaccgg atgggtatgc gaggcctgtg 

65521 atcggcatca atggacactg gccttgtcct cagatcgacg tcgatgttgg cgataacctc 

65581 atcgtcgacg tgtacaacgg gctgggtaac gagtccaccg gaatccattg gcatggtctc 

65641 catcagaact acaatgggta catggacggt gtcccaggag tcactcaatg cgagctccac 

65701 cccaaccagc ggatgcgcta cattgttccg gtaagctgac cctgcttgct ggatcagtac 

65761 ctgaactgcg ccttctgacc catggatcta gatcaaccaa accggcacat attggtacca 

65821 ctcccacgag cccggtcaat atcctgatgg gttgcgcgga cccatcatcg tgcacgataa 

65881 tgttcctgcg ccgtatcact atgacgagga aatgactctc actctatcag actggtatca 

65941 tgtacaaata cctgtcttgc tcgatgaatt cattgttcca tcggaccatg ccccctatgg 

66001 cggtcaagag ccactgcctg atgcggtttt gttcaatgac atgcataaca ccaagatcag 

66061 cgtcaagccc ggcacgacct acctgatcca cgttgtttgc atgggaaact tccccggcca 

66121 tgcactcgtc attgacgacc acgacatgac cattgttggt atggatggca ttgcagtgga 

66181 accgcatttt cttccacccc agtatctgcg tgtggccgtt gggcaacgag tcgacatcct 

66241 cctcaccacc aaaaacgata ctagcaagaa ctatgcgatc tgggacggcc tcgatattga 

66301 cgaaatgttt gttcaagagg gaaggagccc acctcaaggg tacaatccca acggcaccgc 

66361 atggctcatg tacaatgaag atgctccttt gcctcctcca ccagtgattc atgtcaacaa 

66421 tgcaagtctg gacttcctcg acgatgtgac cttgacaccg agagaccatc ttccgttgat 

66481 cgacccaata caacaccaat tcatcttcga tgtctaccca accacagtgg atggaaagcc 

66541 cgcctacagc atcaacaata tgacatataa cgctcccgac tctcccacgc tgtatactgc 

66601 catttattct gaccctgagg cttgcatgga cccggacctt tatggggatg tgaacccttt 

66661 tgttgttaat tatggtgatg tggtggagat tgttctcaac aaccaccatg acaacctcca 

66721 cccgtggcat ctccacggtc atgatttcca ggtgctgcag cgtacctacc cgtacggtgg 

66781 cttctttgat ggctaccgta acgtttcggc gactcccatg cgcagagata cgctcatggt 

66841 ggagcctctg ggacatttcg tgattagatt ccgtgcaatg aatcctgata agtattccct 

66901 cgctgcagat tagatgtctc caaactgacg ttgatcggtc aattacgcat ctggatgttc 

66961 cattgccata ttgagtggca cgttgagatc cgcctcatgg cgatcattat cgaagcaccg 

6 7021 gaccagctgc agaacatgac tataccggat gatcacctca agatttgcgg aaatacccaa 

6/081 cccagcaatt gggccccatt ggctccttcg ccatagtcag cccgacccgg catcaggtat 

6/141 gagtgccgtt gatatgttct aaaacttact gacttagtcc agaaagagcc aattggccac 

6 7201 gcacacaaca tctctatctg ttgacaatgc aggttttgaa gccactatac tttgtcaatg 

67261 tgagtgttac aaggatctga gacacataca cgagacccgc tggcgatcgg gagactcgac 

67321 ctgtgtgaat gggcccttcc gatgtttgga tatgctttaa tgtccgtggc gaccatattc 

6 7381 cgtgataagt aatgttacga acaatagacg aaaagttgta tcatctttct tctaccgaag 

67441 ttgcagctgc catgcagttt tcactcatag atttgtatgc cgcatgtact tcctgtagcg 

67501 caatagccga gcgactttct gtagtcaggc ggatctctca ggtgggcaac tctatagcga 

6 7561 ggcttttgcc tttattaatc cgttttgtcg catcaagtcg taaacatcct ctgccacacc 

67621 ccaggacaat tcaaaaccac gacccccagc tccgtatgca tgaactatca cttcctgacc 

67681 ttgatttatc ttctccactt caatcctcat gcccccctct cgtgctggtc gacggccaac 

67741 aatatcacga atcacatcga actcttcttt acttccctcc tcgaagggaa accacttggc 

67801 agcattcgcc aatagctggg cccgtgtttc cattgacgga ttaggatccc aattatgcgg 

67861 ctgcttcgtg cccccaataa ttgtcccacc acccaatggg cgtggaatac agaaggacca 

67921 cgagccatcg ctattctgtc gagtgatagt agccgagcaa ggattcctca ccagacacgt 

6 7981 ttgccctgaa tagtgagtca gcaacatcaa gatcctccaa gagaaaacaa ccactcaccc 

68041 cgaataatat aactcttttc atccccaaac cccaacccag aacaattaac cacagccctc 

68101 accctctccg cgagataaaa cccctccctc aaatctacga acgtgtactc tttcacctcc 

68161 ccacccctca acacaaactc cctcaacaac cacgcacaat acacgggaga attcaccaca 

68221 aacgtcttat acttcacagc ccatttcaca tcccccggca tctgcccggg ctgtaagtac 

68281 tcaaacccat ccaaatgccc atacgcagcc tggatgttct tctcgttgag gtactccgcc 

68341 ggcgggttct cgaggtattc aatgccagta gtggactgaa cacctgcgcc cggctcctga 

68401 gccgcgagtg tcttgaagtg tgcataggtg cggcgggctt gtgtttcttc tcgggtgtgt 

6 8 461 tgggcgttgg agccagggac gggacggtag tgggcgcctg cccagggtga ggcgtagttt 

68521 agggatgtgc tgtgaggaaa atctcgggct gcaatgagga cgcgttggga tggggagagg 

6 8581 cgttgttgga tgtagagggc tgttgtgagg ccgattatgc cggctctgtt tggttggaat 

68641 gggtgggggt ggttagtggt gctgttgttg tggttgaggg ggtggcattg ggatacgacg 

68701 taccctatta cgacaatggt ttctttttcg gccattatgc tgtgattgtg attgtctttg 

68761 cagactggtt ggattgtgtg tatgtgtgta tgcaagtggt gtgatgatag tgttggcgga 

68821 gattgagggg atgataagcg tttctggaac cggttccccg cacgcggagg aattgagtca 



68881 cactttgcgc cggccggtgt gcattgcgtc tttcatagct atcaatcgtt acgcttttgg 

68941 tcaatggctt agtactactg caaatggcta cgcaacaggt gattagtgat gtggccatgg 

69001 cctgattgcg tgcagaatca cccggacaac tatctattgt tcggggatag atacgaacta 

69 061 tagctagtat tccgaagtga gcgtggagat taaacagaat acgattaagc tgctattcgg 
69121 ttctagctgc aattagcaag gctagaacag ccggcactga ccatttatct atctgctatc 
69181 taagctctcc gagtatcggc ttccgcctgt ctcgcttcgt cacgtccacc ctgtgcataa 
69241 catgatggcg gtcgttagaa ctatctggtc atcaagttgg accagcccat gatgcacgtc 
69301 gcaagccaat atgcagctcg acctggcctg ccaggtagcg ttgccatcgc atggacttgc 
69361 acaacatcag acaggatgat cgtgctcgga cgcagtggga ccagacgagc tcctcgtttg 
69421 taagatgaag acgaactgaa tccgcatccg ctagtaagcg gtgcttgggg tatatcttcg 
69481 tgaaacgtcg tactgtggcg tggtatcaaa cacacatctt ctttcttttt cttttgttgc 
69541 cttgcttttt ttaccaggct taatccaaag catcgggcaa cttctcgatg gtgaatttgc 
69601 ccctagcaac cttcatgatc ggaacaccag gaatcttgcg caggcggcga cacagatctt 
69661 gatcattggt cgccacaatg taaattctat gcttggtcac ctacaagagt cgttagcgcg 
69721 ctgtgctacc caagaaaaat ttgagtaacc actcacccga tccacgatac agtcatctgc 
69781 ataggtaccc ttgtgagagc acttaaccct agaccaccga gggtctttgg caacccgaag 
69841 tgcgaggcgg tacttgtcac ccagcttttc caattccgcg atagtgcagt ctgagaaggt 
69901 cggaatacac ttcgcgtaca gcctgcagcc agtcttagca tatatccaaa gtccaaatag 
69961 tgttccaact cacaagtcca tcatcgaagt aagtaggtcc tgcttggctc gaatggcgtg 
70021 ggaaacaaag ttggtatcga ccagcacgtg gtagggaggg ccaagggcag tgttcgcggc 
70081 aaagaacatg ttcgaaggag cttgggtgac ttgtcgagtc agctgatcct ccttgggcgt 
70141 ctcctttttc gcttccggct tcttcagtcg ctcatcacgg gctttgatgg cgcgcttcat 
70201 ctatagattc agtaggatca gtttccgttc cgtatctgct gcgaagccac agagacatac 
70261 ctgcgcgaac ttgcgagtct tcttttgcac acccattttg tgataataga ccttcgccgc 
70321 taaacagaaa gaaaaagtac accagcgggg aacgggagcg gtattcgttg ctatcgatat 
70381 cgttgagccc agaagatttt ggagaaggcg gaaaatcaga tcacgggccg atgttttgcc 
70441 gcgatagcca gagtcaagta gttccgctat catgtgatct acccgtgacc gcttgttgcg 
70501 ttgcacggca tccccgtccg gctaactgga aatagtagtt ccctgtcgcg aagtctagag 
70561 cttcttgtta ctctcgttgg catcatcgga aggaacgtaa aggcccaggt gtatgtacag 
70621 ttgcactctg gctctagctg aagagtattc gtctaaatat tagatgacca taatgccatt 
706 81 gtcccgagcg tgacacagac tgaatcctac cgtgcacgag gcatcttggt ggatgatgaa 
70/41 tttcagtcag tctccgttcg ccattatggg ccgaatgcct gcctctcccg atgcctttct 
70801 tatcacatcc aaggaagagg caaaatctca tcaccgtccg agctcctcaa ccacccccgc 

70 861 aatgtctaag ccccgggacc tcccactagt cccgaaggac gccagtcaca gcgcagcttc 
70921 tagttgatct catcacattg catgcacagg ccgtgcccat ctgtggaagg gctattttcg 
70981 actccatact ccgtatacta gtccagccat gtatgtatgt cttagctgga tgtgcgtaga 
71041 attgttgtct caccagctcc gactgtgacc gaccgttcag atcgttagct cctgtcagtg 
71101 tcaaatgaat cgccgatgaa ggtacaccat attgcggcca tctgaagtat tcggccttcc 
71161 caagggaaaa taacaaggtc gtgtagacgg tgatggtttt cgcattcata atcctagcgt 
71221 attttcagtc tggaaattgt tgtggtagag tagtaccatc ggttactaca ccgccatagt 
71281 atatactaac catgtagcgc cactatccat cttgagaagg gtgagcgcaa ttcagatggt 
71341 atcggtaata ggcggcctta cagcacttgc aatgatggct tttttttcgg tttatcttca 
71401 aggtttaagg atagcacctg cgtgcgccat cttatttcca cattttcaag gcaagattgt 
71461 agtatgtctc ctgtttattt cggttcttct acgtactatt tcagtgttcc aatccacgta 
71521 agtacccaag tatatcacgt acctcagtag ttccgtcccc agcacgtcga gggatgtgat 
71581 tcaaagtccc acagtacatc acccaagatt agccacaatc ctgctgacca cacgggttga 
71641 gcagtcgagg ggagggttcg gcgataccag tcagagcgct catttacgcg ccgagcaggt 
71701 cacttgaggg cgactccagg gcgggtgccg gcgctgattg gcacaaccga gtgatgaggc 
71761 ctgcagcgtg aagaaacgcc ctctgcagtc cgcaagccaa tcaggcctga ggagccggcg 
71821 ggaaaggtcc ctgacagcag ccgctggcca tgcagaactc cccttaggta aggtattgta 
71881 tacctctgcc tcccatacca aatcttagta ctaactaagt cactggtact actccagccc 
71941 cgcctgggaa aaacggacgc atgcaatagg attgaactaa ccaccaccca caactccgta 
72001 ctaactacta actcacctca ctagtatggc ccccgtccga ttctctctct cactccctca 
72061 cccccccatt tcccgcccat ctttgaccgg tgcaacaatt ggcgctcatc tcatctcctt 
72121 catttttagt ctcaaactct ttcctatcca ccgattgcgc ttcatgttga ttgtgggcgg 
72181 catcgttcga tgattagccc gcttgatctt cctatagctc ctggttagcc cgccaacggc 
72241 gtaacacgaa gatgactgaa ctcctctcgc agccccccct ggtcagacgc ggcaggatcg 
723 01 actggcagtc tgctttgcac tccggtttcc acgtcccgct gctgcatgct tgttggtgtg 
72361 ccgctgatgg agccaccgac agctcctcat gtcgctcgac gggtcttatt tgctctcgtc 
72421 cgctaattaa gtcgttgaac gccctgctcg tcccatatcg atgatatcct ggtccatgga 
72481 agcgccgtcc actccccttg gcgctagctc gtgttgcttg atggctggag atttcgatcg 
72541 actctctccg ctgccttcgg agacgtctca gatttcacag gaaactctcc ttgatgacca 



72601 cacaacggac tctcaaggct ggcgccttca ggagaataat aaggaataag agtccacagg 

72661 gccagagcac gactatcagt ccatgctctg cgtgaatcct gtgttttgtg ttagtgcaat 

72721 cgttcctcat tttacttttt gctcccctcc ccgcctctgc cctggtcttc cttattatct 

72781 ttcccccctc cacaattcgt gccagcaacg tggtagcctg cctagtgcgt tgtatccctg 

72841 gtcgttccac tgcatcattc tttgtctttc atttgcattt ttgtgcgtct gcctttttgt 

72901 tcggttcgga tttagcgcgc tatcaccatc gtctgtcgct gtaccctgct ataacctcgc 

72961 ttcttcctcg ccacccccaa accccaagcc tgttgagacg aaaggccgtc attcggatca 

73021 ttcctcttac tcgtatatca gcactctaac tgttgtcttc tcaattctct agtcaaatgt 

73081 ctgttgagat cacgttgaaa attgtgttct cgagtcggta gtcatctgaa gcagtgacgt 

73141 ttcgttggct gtcctgagcg ggactgcact catttggtgc tcgtatcgtt cttgtcttgt 

73201 cctacgtgcc aatgggaggt ttataattac cattgctcgt tttggacgga tagcgtctct 

73261 gcccttgtca cgccgtcatc gttgattatt taactgttac agaagtgatc tgtgcccgga 

73321 ccgagcactt gacacttgcc tgtctggaga cggagcttgg gattgggaca gggaacagga 

73381 aataaacact tccagcaccc cgtccaggga cctctctgcg gcgcagtcga aacgatggtt 

73441 gaactatcat ggaacagaaa tccagttcaa gtcccagatc ctcgaccttc gagaatgaag 

73501 cccgccgcac ccccggtatc aacacagcca cagacacctc ctgcccggct tgagccatgt 

73561 gcgtctactg cgtctttact tctctatgcc cagggctcgg tgattgtatg tctccaccat 

73621 gatactttgg cgctcgaacg acgcttcgaa aatcaccaag acgatatcgg attcatatat 

73681 gtggataatg ttagcgagcg tggtgcggga aggctggtgg tcagttacga tgtcagccag 

73741 actgcaattg tatgggacct gttcacagga agcgtactcg ctaggttcgc ctcgttcgag 

73801 cagctcaagg ttgccgcctg gatgcggaac ggcaatgtgg cctttggtaa cacccctccc 

73861 acccacaaag ttctcttata aacaagcgct cacaaccatc cgcaggtaat gaaaaaggag 

73921 acgtgatcat atttgagcct tctacctcgg agcatatttc gtgtcgcacc atctttgatc 

73981 ccattaccgc tttagccccc gctacggatt gtcgaaccta tgccattggg taggtaaact 

74041 gcgcctagtc ctcacaagag tcgaatggtc taacagttcg gcttagttat caaaatggtt 

74101 ccattatgct cgcgacactg cacccaacat tcagcatctt acataccatg tccacttcac 

74161 gaggaccatc ggcgataatc tcgttagcgt ggcatgcatc ctcgtctaaa cagaagtcag 

74221 atatgctagc cactttggca gccaatggtg atttgagagt ttggagtgta gcaaaaccac 

74281 cgcagaaaga ggccccacgc gtgatcagag tgctcaagag gtctgatggc tccccagcca 

74341 ctgaaccaaa gtggatggct tggtctaaga atggcaggat cgtgcagttt cttgaagggt 

74401 aagcaggagt cttcatcttc ccccgggtta tgaggctaaa acccgtcaac agcgaaacgt 

74461 gggcttggga tgtcaggaca aaacatgtca cgtatgagcc catccccacc atagacaatc 

74521 ctcgtggctt cgccaactat gggccgaccg ccacactatt caccctcggc cctcaataca 

74581 ctgttcagca gtacgatata gacagccctg ccatggtcgc gaatgtgcag catgccccct 

74641 ctggagcttt gccggcgcct atggagcaac cgagactacg caccttgcag gaaccgccaa 

74701 caatgaggga gtctggtgcg atgtttggct ctcgaaggac cccatttgac acgaacggaa 

74761 ttgaggctgt tcggcaacag cgtgcagatg tcgcaagccc cgtgaatagg agccatgcca 

74821 attcggtcac gaattccgtt agctcgaagg cttcctcggg catgcacaag ataccattct 

74881 caccacccag taggtccggt cagacgacca catcattctc gttaacctca gcatccggga 

74941 gagaaacgcc ccaaccatcc ggagcttcct atgcctacgc atcgtccgtg tcaatgtcgt 

75001 caaccaaaag ctctcgcgcc ggttcacggc tgaggaacga ggtacagctt agtcctgccg 

75061 aaaagccagt ggatctattc ccatttacgc gcgcacgcct caatgatgta ccttatggac 

75121 accatcagcc ccttgacgag tctcgcctca cgccagacga tctgcgtcaa cagatgctga 

75181 gcgtcgtatt tggcttcgat ggagatattg aagaattgat tagggatgaa cgtatgttag 

75241 aacctgtgct taccagatat gtttctaaac attcgctaat tctgtgacca gtgagcaacc 

75301 atcccgaagc tagccaaagc gccgttctac ttgctcagtg gattggagaa tccgatacag 

75361 acaatatgct ttccatgatc agttcggggc cagcctcaat cacagactgg atgctccttg 

75421 catttagcca gatgtctgga gaatcgcagt cgaacaagat cggacaagca ttggtccaga 

75481 agctgctaga aattggagat gtccacacgg cagccaccat actcctagga ctgggagaca 

75541 gaaacgatgc catcgaagtc tatgtctctc agaactatta catggaagcc atactgatga 

756 01 catgccttgt aatgcctaca gactggcagc gacaatcgta tttggtccgt cggtggggag 

75661 agcatgttgt gtcacactct cagcaacagc tcgcaattag atgtttcatg tgcacaggtg 

75721 tagagccctc cgagccatgg acttctcctg ctgcacaaca ggctacatcc ttcgcggaag 

75781 ctatcaccag aaggtctcca atgacatctc cggaacctta tccgaaccca gcaagtctta 

75841 tggtgccaaa ctctcaccca agtccagctg tcaatggcag accggtatcg aagacaccag 

75901 cgctgaagct tattacttca tttgactctc aaccaaacaa tcgattcaag ttccctggac 

75961 taaagtcaga tgacagaacg ccaacaaatg cccccggggt tactccgatc gcagaatctg 

76021 cagtcgtcga atctgccttg tctccgggtg ggtttggatc gtacaagctg aataatattc 

76081 agagcataaa taacgcgatg aatacgcgca caacaacgcc tgccttcaac agacggcgtc 

76141 ttccatctat cggggagacc ccaattgacg tacaccctcc tacattcccc tccacgggat 

76201 acaataaaca agcgctagac tatggatcaa cttcagagaa tgacgagagc cagccgctag 

76261 agcagaaagg gggggaagca gatgacgaac agcttggttt gctatcatct gcccgttatg 



76321 atccagaaga gtacaagccc agtcctcaga ccgcagtcca agcgacagca gatacattct 

76381 ctaccatcaa gggtcttcct tctccggctg ctggagtttt tgaagccttg aaggagcgtt 

76441 cagacagccg aaacggaact agagaccgta aaccagatgg cttgcagcta aatcttcatc 

76501 accgcaacgc atcggaaaat gacatgcacg ctcaatccga gatgctggcc aacttccgca 

76561 gtccagcgtc aacgttgaat agtttcagtt ccgcaaggag ccccagcgtt agcggccgca 

76621 gcattgaccc gtatatcagt agccttggcg aggcgaacta tcacgcgaag aagtatcaag 

76681 gaaatcatca taacggtaga ggaagacgga atacggacga ctcggccagt cagacctcgg 

76741 cgcggaggta ccgttccagg aacgcttcac aggagacgcg tgggcgaaat gaaaagagat 

76801 acattccacc cgccaaacgt tctccctcat ctcctgtccc aatgtcgcca gaggaactgg 

76861 ctcgtatcaa cgtcaatgcc gccaacaagc catcgtctcg aacacgcagc gctagcaggg 

76921 tccgcaagcc acgttcgcgc aactcctctg aacgccgtca aaaccgttct gccagtcgga 

76981 atacaaccaa ccgtcttggt acggaaaatc aagatccctc cagtcgagga cgcagcatgg 

77041 accgtcaaga atcaggcacc cggtctccct cgtctccatt gccaggatcc agtgcagaag 

77101 acgcattgag actagttgcg agtgatagag aacggttgcg aacccgccaa cggagcggca 

77161 gtcgtcgacc tcaaaagggt gcaggtggga ggggcgagac gccatctgag tcgaagtggg 

77221 cgagagcaag gtctcgtagc cgtcacgata ccgagccgac agagagcgcg caacaaggtg 

77281 tcctagaatt tgtcaatggc agtaccgctc atgatgacat ctacatcgtc gataacagtc 

77341 ctagtgccaa taacatgctg gcacaggcat ctcacgcgga gcacactgct gtcctccagg 

77401 ataaacttgt ggaacagcgg aggaaggaac ttgccgctgc agagcttgaa gctcggcgcc 

77461 tgtcgctggc acgcaatcca tcggcgccta acatcccatt tccaggggag ttgcagccga 

77521 gtcgttctgc tttggagaat gcaacatttt ccatcaatgc ttttgcgccg cgtacccccg 

77581 gccggaaccg gactgctaca aacaagggct ctccagagta tccaagcagc tctgattcaa 

77641 actcttcgag gtctggtgtg cctttgggac tgcccgcaac cccgaaagcc atgaagcacc 

77701 ccaaatataa cggctatgaa gaaacagcac cgacgccccc ggtaccttct gatgccggta 

77761 tcctacttag tgatgccaag taccagggtg atgctgagcg gatcggtcgt tcgatgtccg 

77821 ttccagtccc cgaaagccac cagtttggcg ctgtaccaac ggatctccca atgcacccaa 

77881 gatacaaccc tcatctcccg agaagtcggt ccacgagccg aacgcgaagt atggctcccc 

77941 gtcgagacag tggtggacat agctacggcg gctcccctgt gacagttagc atagaggaaa 

78001 ccattgaaaa tgcaatggca aaccagcaga ccgagaatcc tcctattttg ccggagctcc 

78061 aacatttgag tcctcctcca ccgccacctc ccccaactgc acctttggcc tctccacgcg 

78121 agtcttcggg gacgatagac atcgccatcg acaatgaaga cttcggtcga ctgctgcctc 

78181 gcgcgatgac cacaggacct acgcttcaca tggatggccg tccgggtttg gaacaacgcc 

782 41 gtatgtcttt tgagcatcgt cggggccgca gtgtcaacga gagcctgagt agcaagattc 

78301 gcaacttggc ccgaatgggc agcaccaaga gcacagactc gtgggttgct actgatattt 

78361 cgatcccgta tgagagcgtt atcgtcgacg ggcgcatata agcatacctt tgaggcattg 

78421 catctaatgt gtcgacactt gcggttggac attcatgatt gatgacattt ttttcttttt 

78481 cattcttcca attctttcta catttgctta tttttctcca tcttcccctt aagccaaggc 

78541 atatctccaa tattggtcat tgtgatagtt gaactgtata ccctggcagt actcttccct 

78601 gataatgatg aacaagcaag aacttactca gctaccagaa gatgcacttt aatgggcttt 

78661 cttttctatt cttacccttc ctcgtctcta aaagtggccc tcgcgggcac gagaaaagtc 

78721 gctccttcac ctatttacct aagcgttgcc acactatgta ggcatttagt gattaatcaa 

78781 gctattccaa actgaagcgg ttaactgcac gggagcatta ggccttacga aagcattact 

78841 tgtaggagac atgcagctct atccgactgg gaccggacaa gtataggtca aaagcaagga 

78901 atcaaccggc agaggggaaa gtcgccattg tagtttaccg gcccatgccg tggacaataa 

78961 aagttagcgc tatgaataca ggccgctata caaacaatca gtcccatttc gcccaccacc 

79021 cagaacggtt ctgctgtgga gtgttcagac tgtccgttga catcttccgc tccgcctcca 

79081 acattctgtc aaactccgcc tgcgccagct cacgctcact cggtttgtcg taggccggag 

79141 ccggagcaga ggcagttgaa ctcggggcag gcctagacac agtactccgg gtctcccgct 

79201 gagccgagac attattttgc ctccggatgc gctcccaggc actttcacca gacatcgact 

79261 ggtccttgcc ccggtactcg gcagccacag gactagcatc atcccagaag tcaccattcc 

79321 catccttcga cggactcggt gcctcatctg ccggcgtagc caccttcttc gtctcacttc 

79381 cccacgacgt cgtactcgac ccgaacctcg ccgcctcctc atccgacccc gactgcatcc 

79441 catacccatc ccccagctgc tgcggcagat tctgctcctc actcttgggc acctcctgcc 

79501 ccgtcggcct cggcgccggt gtcgtcggca ctggcggcgg catcttcccc gtcgtaacat 

79561 gacctctcaa ttcatccctc ctcttcctcg cctcggccca aagccccttc aatctcgaat 

79621 cactcatcat atgattaata tcattcgcgc ccgcaatgcc atctcccagc atcagcccca 

79681 caaatgcaaa aagcgccgtc ttcaaccctg cccgaatagc ttctttcctc cacgtatcgg 

79741 gattcgccct cgcccacttc tgaatcccct caagcagcaa atcgacccgc cgcaggcccg 

79801 cttccgacga gaaatttttc ttccacgcct ccgtgttctg cgccgccttg taggtcacga 

79861 tcgcgccggc gagcacgcca accggctggc cgattcggct gtggtagagc cgcgccgtgg 

79921 agtgctcgat tgtggcgtcc gcttcggctt gtgtcaaggg acggtttgcg atgcgttggt 

79981 gttggtcgag tgtccggatc acgttcatgg ttgctactgc gttccagacg ctttggccgc 



80041 cgaagcccag gtacatcgtt atcgggtccg ggcgtttgaa gagctccggt ttgaccttta 

80101 gcgggtagaa gcgggagaat gggtcgttcg ggtctttcag gaccggggag gtatcccctg 

80161 ttgatggggg gtcggaggtg aatgacattg tggagggtgg gctcctgtgg tgtggtgttt 

8 0221 tggttggtgt tgtggtggtg gtaatggacg gggggagctt ggatgatgat ggaattgatg 

80281 atgcggagtg gttgtgctgt gctggtttag tcagctgcat agtgggttgt gcggacttgt 

80341 tgcttgggtc tgctggacta tctacagtag gcgtcgctgc atgttagact cgtttaacat 

80401 agttctacca gattcagtta tcagctttat ttatattggt agccatcagg agatttgtgg 

80461 gagtcaatct tttctttggg actgtggaat cgatattgta gtccacttag atactactcc 

80521 gtaccctgtg caatttcaca tccttgaaca atcaaaattg gaatactact cccaatgata 

80581 ctactctccc tattttccaa tcttctcaat caccacctct caacatacgg tctcaaatcc 

80641 aactcaaacc caaacgtcgt cctgggctgc gtatgtaaat gccaatacga atcggcaatc 

8 0 701 tatgcaacat cagcatctca cccccaaaac aagcaaccaa acagcgaaag gaaaatgaga 

80761 taacttacag cctcaggatc taacttggca tcctccttct cgaacaccca ccccttcgtc 

80821 ctcgggatat caataacccc atcaataata acatgggaga catgaatgcc cttaggcccg 

80881 aactctctcg ccaacgactg cgccaacgcc ctaagcgcga acttgcccgc agcaaaggaa 

80941 gcgaattgcg ccgagccctt cacgctggct gtagcgccgg tgaaaatgag agtaggtggg 

81001 tatttcaaac ctccttgcga ctcagccggg aaaagtggga ggacattctg tgcgaagttg 

81061 aatgctcctt tcctatatca ccacattagc ataccatacc cagttaaaag attagagagg 

81121 ggaatgaagg tagtagcatc atacccctgc ccctcaaacc cggaactaaa ctccgtctcc 

81181 gttaactcaa gaaacggctt cctcacgaat ccaccgcccg agttaaacac tgcggctgct 

812 41 agtgatgaac cctggaactg cgatttgatc tgctcaaagg cggagcggac accagtcgag 

81301 tcggataaat cggtgctgat gccggtggct tgaccgccgg aggagttgat ctcgtccacg 

81361 acgggctgat agtttgctgg gttgcgggct aggactacga tggggtagga tttggcgaag 

81421 cggcgggcga tggaggcgcc cttctcattt attagttgtt tgtttgggtt ttgtttcaag 

81481 ggatggatat ggttgatata gtatgggagt tggtggtggg aaggaggggg gaagatgtac 

81541 gtacagtgcc tgggccgacg ccggcgatga tagcgaaggg tttggaggcc attttgatta 

81601 ttggagagtt tgctttgaag caagagaaga actaattttg ggatgtttat ggaaaatgag 

81661 gggtcgaaat gggactatat agtaaatgag gaggttttca tggcgtcatt ccagtcgggg 

81721 ttcttgtcat gccgagatta gaaatccgcg atgcgatcag taatcaaggg atgggggtgt 

81781 agccgtggtt ggaattgaag tctagacttt ggaattagga agtatatagc ttgtaagaga 

81841 agagaagagt gaagtcaatt gcctgatggg gtattaagag ctattatagc tatgtggcat 

819 01 atggttaaag tctgacaggt acaagtagca catggggcaa ctatacccaa attggtcgct 

81961 actgttcgga tagctctgtc tacagattgc tatcctttcc cccctcaaac agtcatggca 

82021 agcagtttac cactctcaat atatgatgta tatcattatc atccatgttt tattttgtat 

82081 tacatctcgc aagaccaggt acacatagtt tatcagcgtg aatgagatag caacgctaac 

82141 cccagtttgt cgatggggga acatcacgtg tctgtatata gagtgactgc gatgtataca 

82201 acaagttctt attgtttgag aaattttaca cattctcata aagtgtctca ttgcaagcca 

82261 gtggataata gagccaaaaa ttatcccaac atatcccgcc atgtcagtgg ttgagccttt 

82321 gtgaccttgc tgtctcggat cgtgcagcat cgactgtcta tactgccgag cgttcaatat 

82381 ggataccgac aaagcagcag aaacaacgcg tcttcacgag gaaaacgaag atccagctgt 

82441 caaagacgtc atccgcgtcg cctcatatca tcgactcgac tacgatctag cagtcatcag 

82501 aactttccct agaccccacc acctcatacg tgacttgctg acccccaaaa cccttatcat 

82561 cgacctgaat tcaagactgg gtgtcctgga tgtcttccca ctagaattca tccacgaaat 

82621 atgtctcatg cttgatgtcc aatccttgtt ccgcttccac cgagtcaacc gtcgtgctcg 

82681 acaaatagct agcgccagtg gatactacaa agcagtagtc aaacatgcca tcggggctct 

82741 gttcgttatc ctaagaacag gcatggcatc ttggtacaaa ttatcggacc tatatgacgt 

82801 cctctgcaca aagaactgtg caatctgcaa cgactttggc ggctttatct tccttccatc 

82861 gttcatgaga tgttgtttct tctgcattga ggacgacgaa cttcctcccg ttctgcctgc 

82921 ttctcaagtc agaccatacc tgaaccggcg ttcagcccct cctctctcca gtcggatccc 

82981 tatgataaag acgctgcctg gaatatactc gttggatgaa tcaaagcgaa ccagaagatt 

83041 cgaacttgta ctggaacaat cagccatctc gctcttctcg tctgaccggg ttccaacttt 

83101 cccgaagaag tactctaggg acacagagct acagcgttac atggtcagta cggccctacc 

83161 ttacctcgac aataagaacg atgtcatcga gaagggtgtc tcctgcagtg gctgccagat 

83221 ttctctggag aaggccttgc aatcgtcagg ggccgaatat gatgtctggg atgaccgaga 

83281 taaaatttat tcgcatgatc aattcatgga acattttgag acttgtccag aggctcgcaa 

83341 gctttgggaa ctaagcgaag ggggcacaat cgagcctgat atcccggcgt ttgttaagaa 

83401 tggaggctgc gctaagaaga gggagctaaa aaatgtcttt tatcattgaa tagcatgtac 

83 461 aggggattgg ttgttggatg atgtggggtt ttgagaattt ctgaggcaat attcagggaa 

83521 gtaatggaat ccaggatagt aatgcaattg aaataattct ccttcagtat gataccgacc 

83581 agacatgtat gataggcgca catgcagatg atcatagcgc acatttccaa taggagactc 

83641 cccacccccg cttctctcac tgcaaagtat tgacccagtg cttagtccat actcgagatg 

83701 ctagtcaagt cttcccatta gagatagttt ctcttgaaga ttacctcttt ccatgaagac 



83 761 aactgacttt ctggaatccc acccaccact atctcctaca aagcctcttc cgcactaact 

83821 caatgagccc aacccatccc tgccctatga tactctcaat ggatgtctcc agtatggatg 

83881 acagcgacta ccgaatatta atacaaaacc gggtcagata cttgacaatt cgtcccggta 

83941 cattcgaccg cgcaaccctt tccatgcctc tcagttcatt gcctgacctt ccgaaagact 

84001 gcacctggaa ttctgcgcat ctcagccgag ataaatccag cggtgaactc gtggtagaac 

84061 tacagaaacg agaccttgtt ggcatagcgg atgtctggca tccttcgcag gttgactgtc 

84121 tcaagcttag acggacaagg cagcttacag ctactacatt cgaagcaact tactgcggta 

84181 gtgagccgat aagctctcct cctctgaacg aggacactac agtaattgcc aaagttgccc 

84241 ggtttgagtg ggaaatacct cgtttatccc aagagacttc catatacaaa cagctacagg 

84301 acaccggtct tgctccacgc tttcttggcc atgtccatga gcacgggcgt gtgattggct 

84361 tcatactcga aaggctcaaa gggcgtgaag cgggcatcaa agatcttccg ctatgtcaat 

84421 ccgtgctcaa acgcctgcac gatatcggta tccttcatgg cgatgtcaac cgatataact 

84481 ttatcgttcg aggtgatgca gttcgcttga ttgactttga gagaagtcag caatccaagg 

84541 gcgcgactac gctgatgcat gctgaaatgg agagtttatg tgaacaacta ttggaggata 

8 4601 caggacgcgg tgcaggtttc actcggtcga atcagcagat tgagtaacag ggtattgtgt 

84661 tccatctcag tgcgatgcaa atggattttt ctcagacaat tgcttgagac cctctggtag 

84721 cggcatgttg atggatgaat agaaattgtt cctgctggag atgaatctct agatgggcaa 

84781 agtactcgct ctaggtttca aaagtgacac aacagtcagt gcagggatat gctagtacta 

84841 acttggtcaa tcaacctttc atcccacatt ttccatcgct cctctagaac cagtaggaaa 

84901 caccttcatc ataacatctc tttatctaca ggcaacccgt ttacttggtt atcccgtttc 

84961 gacggtattc atccaatcca gcgggcaata atgtaaggga tatatggcac ccgtattgga 

85021 gcagcaggca gcagataaga agatacacta ggaacttcca tttagaacca gcccgtgcat 

85081 ccaggagatc cactacgagg gaacccgaac aacgagtgtg atcaaaagcc cgggtggccc 

85141 tcttcggata tgtgacgctc acaacgtatt tgctagtact ctcactgaga acacgttagt 

85201 ccctcgatag gatttcatga atagtatccc aaacatcgcc ttcgaaataa gacttgttcc 

85261 tatctctttg ggaagaatca gggcttggtt ggatgcctag ccacatgaaa actatttgag 

85321 gggtcatcga aagtatgatt gatacctgta tttcaatatg agcttgtgga ggtagttgat 

85381 attctgtaaa gactagacat aagataaatt atattactaa ggctcaaccc ccaccatctt 

85441 catgtcactt atgacctcga cacctaaaac ggagtcatct ttttctgccc caggcaccaa 

85501 cacctcaaag ctcccttatc gataagccaa tctccgatac gaaaatccat ccatgctatc 

85561 attttttctc cgcatgcgac agacaacatc cacctcttcc tcctcaataa ataacccaaa 

85621 cccctccacc acacctcaac ctaatacttt agtctctcat accacagaag aaagaagagc 

8b681 aatgtccacc tccacccccg actcccccct ctccgacctc tgctcaatct ggtccgtcca 

8b/41 acatcctccc atctcatccc aaaccattct cccaatccac accaactaac cacaaacctc 

85801 ttgttgtcaa aaaataacag ccacctacaa ccccccaaat accgctgtcc gcgctgctcg 

85861 acccgcacat gctccctccc ctgcacgcgg cgccacaaac tctggtccca atgctccggg 

85921 gtccgggacc cagctgccta cctacgacga tctgaactag caacggaatc cgcctttgac 

85981 cgggacttta acttcattac ggggattgag cgctcgattg agcgggcggg acgggacgcc 

86041 gagaataggg ggattgatgt tgcttcggag gtaagggatt tggaggcggt ggggttggat 

86101 agtgaggata cggagggggt tgccgctatg gggaagaagg gtggtgctgg tgctgggggg 

86161 aagagaaaga ggggagatgg tggtgcaggt ggtgggacga gggctgagat ggagtttttg 

86221 agacgcgcgg aggagagagg ggttaaggtt gttagagcgc cgaaggggat gagtaggaat 

86281 aaggggaatg ggagtcggtg gttgggaagg tatgtgtatc tctatttctt tgcttgattt 

86341 ggggtgattg tttggtgggg ttgggatgaa cgctaactgg gtggtatgta ggaatcagtg 

86401 tttggtttgg acggtcgagt ggattctgag tgatggggag aagaggatga gaaattgttc 

86461 ggagacggct cctgttgcgg attcttatga ccgcgtggtt cctgttccca aggaggaggg 

86521 gagtgatgag cagggtcaga aacagactgg ggatgagaag ccggagggga cgtccgagtt 

86581 gaagacggag gagcaacaac cacaggatgc tccctcgact acagcccccg agccagcctc 

86641 gactacagaa cctacacagc cgacagacgc accaactgct caaccggaag cttcaaatgc 

86701 gacaccaatt acatcggact cgctaccact taccccccac cgtgacctct acttctatct 

86761 tcatcgtccg cggacagcta cgaaacagcc tgttctggcc cccctgcccc caaagatctc 

86821 acttgctgct gctctacgtg atcacatagt cttggagttc cctactatct acgtactgcc 

86881 ggactcacct gatactatac gagcccagga agatgccaag tacatcctag aggaagagta 

86941 cctacgcacg catcagtcgc cggaggagag cagcgatgat acgggggatg cagacgacgc 

87001 tcaattgcct ccgggtgcca ttgacctccg aggcgtcgac gagtccaagg ttctggaagt 

87061 cctccagaag gatctgttcg agccttcggc agcagcgtag ggccatctag acttggctgg 

87121 agcatcatga cagagaagga ggaggggatc attcacttgt acctaatact tgttcatatc 

8 7181 atagcgacaa ctggagagcc gggtggaacc gtgctttact ttggcatctg tactaggaag 

87241 aatcatctat gatcggctga tgcacttcaa ggcagtctga agcggcgaag tcagcgaact 

87301 cgatgcgcat acgtagttac tcgaagtccc gacccgttcg tcccggatcg agctatgctt 

87361 ggtcaataat cactggttaa ttgtgcactc aaaatcccaa gtatgtacgc taaatccaat 

87421 atccaaacat ggggtatcac atgaaccaaa atcaaactta ggacgccatt tatgctttgt 



87481 cgggtatcaa cgtccgtgac ggctttagat agccaggata tgacagattg tcgatatatg 

8 7541 tcaaacgctc acttttggcg agcaactgtc gtctctgcct ttcaggatat ttcatcggag 

87601 gttccagcag gctccgcagc ctcgatcaat cgatggcgcg acataagcct ggcgcggaac 

8 7661 ggcgttccaa gaatacccga gcgtctgcta tgcagcggca aggcggttta gacgacgtga 
87721 ctgacgtatc gacgtattga gagatatcac aactgcagtg tgtatcccat ggggctcaag 
87781 tcagcggact cgaagcatca gacgcggtgt ctggaagctc gtgatctccg attttctcgg 
87841 catgatcaga ggaaatagat aatacttcag taggaatcgt attcggcaga gttggttcat 
87901 cctcaggtgc agccttgctg ttgcgagagc cagctttcga ctcggcggct ctagactcag 
87961 tgggcgacaa attcgtccgc tccgatctgt tggatcgaag tcggattcgg cctttctcca 
88021 aggctcgaag caatgtgcca acttcgtccg atacgatccc acgaagcttg ctcattgtgt 
88081 cgtctgacac atcagagtag gcattccagc caagcttatc agactcttca gaggtcaaag 
88141 acttagactc cacagagggt ccagtgatga gccttttcag cgtcgtaggt gacacggttg 
88201 gccggcgtgc cattttgacc ggcttctcct tgctcgaagt agccgaaata gcctcttcct 
88261 gctggcgttt aggtgggcga gtaatagaag cattcgggcc ttcgcccact tcgtttcgaa 
88321 tcttataata taccaacccc aaggtcgcgg ccagcgcctg gtgtgcggcc tcggggtata 
88381 ggcgtacggc ttctgcgcga cttaggttgg acggataatc actggaatca ctggaatcgc 
88441 tatcactggc aaagagcgtc tgctcttgat gcatgcggcc ccaatactcg tatcctccga 
88501 gctgatagaa ctcggccagt tgcgccaatc gattgactga ttgatgttag atgcttcttg 
88561 ttgtgaggca catgctttct gcacttactc ctgcttcgct tcttcaaaca ccagaacctc 
88621 caaacagtgc gagggaagta ttcgggccat tgcaaggacc cgtcttcctc gactacagga 
88681 acaggcgtga tcggggcgtg tgtcgtgtgg gccaacgagt tgaaacggac tcgttcagaa 
88741 tgcttcatcc gagcttcgat gagagagaat cgattgaaca cagaagtcaa atgccgcgat 
88801 gtctcatgtt gcagggtaat cagatcagtt cgaagttggc tgatggtcgc atgtagtttg 
88861 ctgatctcca cgcggcatcc gcatacgtca gtctgcaggt gctcaacatc tgactggatc 
88921 tgttccacgt tcttcgacaa gcggtcgaca tcactgcgaa ggaagacggt ctcattgccg 
88981 ctttggccga tactggactt gagttgggat atccctgact ttaactggtc gacgtctgtc 
89041 ttcaacgcca cgacattgag atcgattcgg ctggaaattg accggtgtgc ttcgaattcg 
89101 cggcgaatgt gttcgtcttc gcggtttcgt cggtcaacgt agccatcttt agtcagaacg 
89161 ctggagcgta gctcgagtag gcgggagtcc agaaaagtca ggcggtcatg gatgtgtgat 
89221 agggacggat ctgctaaggc tggaggcgtg ttttcgtgga tgagagataa tcgggggctt 
892 81 gagacaggga tggacatgct cgacgtgcga gagaggctgc gttcgagtgt tgtggtcatg 
89341 atgggcgaaa ggggggagac ggagaaatgg tataactgcg tgggttgata tacaaaagca 
89 401 cgcatagaac gctttaggcg ggtatcagca acattttgcc tgagttttga gatgacagcg 
89 461 agatggttgt ggtggagttc gatgacggtt gttggcggtt gaaaagcgat ggctggggcg 
89521 ttgcaagagg gaaaagagac gagcgaccga cgagagagaa ggggttccag gcagatcccg 
89581 cccctcgctg gctgccttcc ttcgtggcac ccgcttcacc caaccgaatt catcccagcg 
89641 cctgaccatt aacacctaca gttgcacaca tgggaattgc catcgtagca gcgcttctat 
89701 tattgtattt ctcaatctaa acaggtatac ttatggcata cttttctatt ccgttgtatc 
89761 tccctcatag tatggttaat cgctgaggat gttatcgcga gccagtcatg acaatacatg 
89821 gtacactcca tgacaatacg gatcacaaca tatcggcggc ttggcagaca cactgcattc 
89881 agctgaccca aattaaccaa tcaccattgt tcctactcat ctcgccaccc gaattggaga 
89941 gttgcatcac ccgttgattg gacaagatga ataactacgg gcagggcttc agcgtcatgg 
90001 tcgtcagctt cgtatcaatg accgtattgg tcatattgta tacgcgatga aaacggttgc 
90061 tcccaaggcc tcatttcctc aaccagacgg gcggggctcg cttatcatag attgagtgca 
90121 tcgctgactt cagattcggc agtaatacag aacattcgag cagcaatctc agatatatcg 
90181 agctcagccg atgctcctcc ggggaaaatg ccatcaggaa tgaggcatct tgcccagata 
90241 aagcgtctca ggagtataaa gagagatact aatcagcgcc attgtcctca taagtgctaa 

9 0301 tgagggagtc ctatactgat ggatgtctgg atttgcttcg tcgcaagaag gaaatgtgca 
90361 cgtcgctaac caaagaacgt ctgtagtttc tcctccctca tcacatccaa ccaactaacc 
90421 ggggagcaag cccagtgctt cttgaactta tactatccgc gcgcatccat actccttcac 
9 0 481 actaaactcc tcgcatggtc caccgattca agaccaagta gccgcggtcg acgacggaat 
90541 aagggccgaa agccaatgcc gcgagtcgcc gccagtgggc cgcatgacag cgccatagag 
90601 tgaagtgcaa acagcgaaga gcatgacttc tagagcgagg gcccgagaaa aggcgagata 
90661 agctaacttg gacagagaat aatggcacct ttgtaagccg aagtcattgg caaacaggtc 
90721 cattatctcc atctcagtgg gctatagtgg aacaaactcc agtatacatt ggagttcttg 
90781 cggagttcta agcggcagcg cttaacccgt ttcgtttaag ataattatat tataattata 
90841 atagggatct tctgagatag tataattata atataattat acacgagatc gttagcggtc 
90901 gcacttaagg aggttgcaat gagaattcat tccgtttccc tcttcctcaa tgaccatcca 
9 0961 tcaaatcaag actggccctt caattaacaa taatataccg catcgcctac tatttaatac 
91021 atctttgcat ccttaatcac ggagtcacaa tgccagcgtc tgacaaaaag aaaagggctc 
91081 ttaagattgc ctttctgacg ctcctgtggg ttcactttgc cggaaaatta gacgaaataa 
91141 gtggagaggc ggacgattca gatgcagacg aagatgaaac aatcgctctg accgatgcaa 



912 01 gcgacgagat gatgatcaag gccatggagg ctgcgacgcg attactggat gaatgaataa 

91261 cacttggatt agggacatag ggactggttt gctgaagata ctgtcgccta taccaggtaa 

91321 gaggttgtga atatagtccg aattatttca acgcgcgtct cgcgacgcga cgctgacaat 

91381 tccgctcatt gacacgaaca ataggtttcc cgttacgaat tcaatgagat cggaccatta 

91441 tgatttagac cattcgagta tactgagagg ctttatgaag cttggacctt gtgggcattg 

91501 gttggttgtt agggggcggc aattgtggat ggttcgcgat gactgcagac agtgtgtgtt 

91561 tagcagagtt ccaattaatg agcaacgaat ggaaatcggc tgatcgcctg gttatatggc 

91621 tacccatgcc acaaacatcc tctaccccag ataataaacc cacaattgac caacaagaag 

91681 aatcctcaag caagatttcc ggtcgacctc gcggtttatg atccccatgt tcccaccgac 

91741 ctccccagga tgagggttgt gcgtgcaaac ggtggtgtgg cagaaagtga ttgcgggggg 

91801 gacgcatctc catacaaagt gggccacttt gtatacttct cgttcccagc agcatagttc 

91861 cctccacaaa tgaacccatc aaggatcaac tcactcccaa gggtgtggta aaataccata 

91921 ctacatctat ttcattatcc agacccaaca actctcaatt tgatggaagt gactgccatg 

91981 acgcaccttc catgtcaaca tatcctatca tcaaatcacc aacatcatca ttctctcaac 

92041 catcctcatc atacaactac taccactatc cctaacaacc ccctcctaca tatatcctca 

92101 cacctattcc tactattcat accaccactt ctcaaaatga aactccttct caccactctc 

92161 atcctcccat ccctcctctc cacccacttg ctcttcgcaa ccgcaatatc cataccatca 

92221 ccccaagaaa cccaccccga cgccaactgc gactgctaca tcacttccag cttcacctcc 

92281 aactccaacc ttacagcccc aacgtacttc acgcactaca aattctggga cttccgcaac 

92341 gcaccccttc cccaaagcgg ccccgaaaca acccaaacca acccgtccaa cccgcacgac 

92401 acctccacgt tggaaatcta ccccctaacc aaaaccctct tctcgcacga ctggcgcgcc 

92461 caaacctggg accgcgaagc aacagacatc agccctgtgc cgatgaaaaa caccgacggg 

92521 aatgtcttcc tgctcaagca cccatccagc acacccaggg accctcgctc atcattccta 

92581 gtgctccgca ccactcgatt ccaggaccac gcatcaacag ccgaaatgga aggtctattc 

92641 ggacacatat accactgttc gctacgacta cgaatgagac tgatgtctcg agacgatatc 

92701 gtaacgacag cagcatccac acaccctaat aaccaaccag gttcgcaacc cggatcacaa 

92761 tccaaaatcc cccaaggcag ctgcgccggc attttcacct accggtcgtc gatctgcgaa 

92821 tccgacatcg aaatactgac caccgaagac agccacacag tccattacgc caatcaacca 

92881 gactacgatc ctatctcgga taccttgatc ccaggggcaa gttcgatcgt gaatctgacc 

92941 aagccgtgga cgtcgtggac gacgcatcga ctggactggg tggctgatat gtcggcctgg 

93001 tatgcggatg ggcagttaca gagtaagagt aactattcgg tgccggatcg gccgagtatc 

93 061 gtggcgatga atctgtggag taatgggggc aactggtcag gggatatgaa ggtcggagag 

93121 agtgtgttca tggggattga gtggattgag ttggtgtata atgttactaa ggctgatcag 

93181 gatcttccta ttaaagatca tccggggcat cgggatcggt atcatcgggt tggagggctt 

93241 ggttctgctt ctgcttcttt ttcttcttcg gcttcttctg tgggtgggga tttggtggat 

933 01 gaggaagatg tcgagcggca gatggggaag gctaaggaga agagcgatgg gtgtcagaga 

93361 ccgtgctgga ttgatccgtg gcatcctgtg taggggttgt tgtgtcgtgt tgtgttgagt 

93421 tgtgtttcta ttctttattc agtgtcatag attcctatcc taattctaca cctggaatct 

93481 tgcgtcattc acagatcgat cgtagctgcg tatgtagcta aaatgtttct tgagtatgta 

93541 ccttcgtaag tacaaatggt cgaaccgaag gggctgtaca gtaaattgga ggcatgactg 

93601 aatacataac catccgctgt tcctctcagt cttcaaatct catgcgccgt gcttgatgcc 

93661 caatgcctat gccaaacgtc caatcccaag acaaatccca aaacacgcac caaaacaaag 

93 721 ccgaaaacat gcatgccagg agctgtggct cgactgacgg ggtatccata caagacaatg 

93781 cgaaacgctt ataagcaaac ccaaaaaaaa aaaaatgcaa acgatgccct ttcctcctct 

93841 tccttcttgc ctagaaacct ggacctgttt gtggttagaa cgcccaagcc aacaagcaga 

93901 gaccaaacac caacgtccag gaaacaaaac aagcgcaaag tgagtacttg cagagcaagc 

93961 accgaaacat gtagatacaa gatcaaggaa agaggaaaca aagaaactga cgcttcgtac 

94021 ctttgccatg aaattccctc ctcttgtcgc tttttctttt ctttttcttc tttgttttgg 

94081 tcctctcccc aagagggagg ggagggccgc cgtccccgtc cccgatcata gcaagacatg 

94141 gtcttgctgg gcggggtcaa attccccctc caggccgccg gccatcagct cggccactat 

9 4201 cttacggcgc aagagcggtg tgtcttgcgc gccgtaagac agtggctcga ggccgatggc 

94261 cacagccttg gccgtggtga ggaggaagac cccgcaatct gaaccattgt cttgctgggg 

94321 ggacagggac ggcaacacgc gccactcctc ctcgacgtac ttggggccca gctcgccgcg 

9 4381 gagccaagtt tggactaccg cgatgtggcg gcgagaccgg gccccaagag aatcgaaatg 

94441 ctcgattgag cgttcccccg gtcgcacaac aaccagcgtc cagtgctggc tgttgtggac 

94501 cgggatatag acggtgtcta cgtcgaggag gagtgagccg ccgatcttgg cccgtttggc 

94561 ccaccgggcc acggactcgt agcctttgtc tctgagattc gaaaagaaaa acgaattgaa 

94621 ggcatggaat cggggcttgt cgttgcggcc ggcgttgtgg ttcttgcgtc ggaggtaatc 

9 4681 aacgatgagg gcaaggtatc cgttgatcac ctcgtcgtta agccactcgc cgcgcgtgta 

94741 acaggtggcg aggctcctct tcgttagagg gtcaccggag agggtggtcg cgacggtttt 

94801 tcctccggat acgcgcccat cttgaagttt ttttttattt gagcttccca ttcgggagag 

94861 agaggccgta cagctggacc atccggttta atacgtgatg ggagcgcttt ctgcctctct 



94921 tcttcttgga tctgtctcca acggtcggcg tacacagact cggttctgcc ctctggaatg 

9 4981 ggttttggct cgggataaaa tagactgaca gctgatactg ctcgacccag cgggaactca 

95041 aaggttggtt gggaccaggg gtcaacgatg ggctcattct cttcctcgtt ctccagggct 

95101 ggcggcacgt tttccttttc tgtgtactcg gctttcgtgc caacactggt ttcggtagtt 

95161 ctctcattga catcaagcgt atctccgaga tggggtgcca ggttgtcctc cacctcaccc 

95221 caatctcgcg tgtcgtcaat gaactcggct accacaggct cgcggaagcg aaccctgcgc 

95281 ggagtggcgc ggcgagatct agttctattc ccagaaatgc tgggccttcc tcttatcttg 

95341 agaattgtct tgggtgcaac agtgattgat gatgagagaa ccttttcagt tgcacccagt 

95401 tgatcaactg cagtaaacga gcgagactga acattcgcca acatgttcaa gtcttgagct 

95461 ctgtctagtt catagctgct tctcatcaag tttagttcaa ccggacggcg acctgcggcg 

95521 gaggcttgac ggtcacgcgc ggcctgagcc tccatagctt tcttcatgcc ggcgatgtcc 

95581 ccaacgaaga gggctctgaa cgcagcagat cgctcacggt tctcggcgcg ctcagcacgc 

95641 gtaatgtgct tttgcttgcg aatacccggg cttccttgcc gacggagagg cttttgggct 

95701 gtgtcttgag ttgaacgttg agctgaacgc tgagccgagc gtggagggga tctccgagct 

95761 ggctgtttca ctggcggcat cccagggtgt ttgcgagttc ccacattctt tggctcatga 

95821 gcacgtcgct gaacagatac agagtcactg agtgcctggg gcatctgctc ctgagacaca 

95881 aggtcctgca aaatctgaga agactcgact gtaggagccg tgttgttgtt atgggagaaa 

95941 ttaggaacgt tggagaagtc ggggaaagat ttgtcggtac gggggctgga ttgtccgctt 

96001 cggattctcc tatcacgatc cacacgctcc tgctgttccg gagaaaggtt gcgaaggttg 

96061 aggcgcgtgg gcgatccagg aggcgaagga agacgacgtc caccgcggcg attccaaagt 

96121 gcgcacacgc gtcgcctagc agcaccaacc gcattcgcag cggaccttgc cgcataggaa 

96181 aagttctgag taattccagt gggaatggag acaactgtct gcaagtagct atgccacgtc 

96241 ccccagtaag cataccaagg tgaggaaggc tgtggttcgg cgtccgaatt cacgggaata 

96301 gtcggctctc cagcgtcggc catgtccgta ttgttggata ggatctcttc atcggaaatc 

96361 atagggctat cttcgatatc ggtggcggca tcggtggacc agtgagacgg cactgattcg 

96421 ggccacgctc caggcatcat atggttctct ggagtgctag gggtggtcga gaccgcaaga 

96481 gcggtgacag gcttgacttt ggaagcgtcc agtcgacgcg acttaggaga ggtttcagga 

96541 agagcgagat gagattcaac tgccgcaagg gctgtaacca tgtttggttc gttgttgtgt 

96601 tcaggagtgt cgatgtccat gggggaacca gggtaccaat catcttcgtt ggcagtggcc 

96661 tcaagttcag tagcagcact gcgtgcaaag ctaccagccc tcacacgacg gaatttagca 

96721 gaaggtgaag tgatgggttg agcttgctca tagccatcac cttccattga tctcttgcga 

96/81 gaaggcgtag taaagccaga atgtttattg acattcatat gaaaagtttt cttgtctata 

96 8 41 ccaatagact tgcctccaag aaggtaaccg gtttcatttg acctgtctgg accaaaaggg 

969 01 aggctgagag gtttgttatt cgaccaggag acgtggttca ggccggcaga tagatatttg 

96961 ctggtaccca gatggcttaa aagatgagat gaatgcgcat tacgaccgct cttctggagg 

97021 gcatcaacgc cgaggaatga ggccgcgcga ctattcgggg agaaacgatg atcgggtctg 

97081 atgagcgggg tgcggagggg gaaggagggc ttgacttcat gttttccatt gagtctcaca 

97141 gcctttggat tgccgggaac atgcacggat cggagctgat actttggaat ctctgcggag 

97201 ggggctttgg cagtttcctg agggatgtta gatgcggggg aatcgggagc acaatccatg 

97261 gggtcatggg cggtattatc gacggcctcc tcgtggggcg ccgaatctga gggcatatag 

97321 catacgtcag gcattgggtg gtcctccacg ctgggtctca tagtcgagaa aaccccagac 

9 7381 atggtgagaa gtaatctaga aacaagttct ttgggtgaaa gaagtggtga aagaagcggt 

9 7441 acgagttcga tcgcgaggaa tggaaagttg ctgtattcgc taagctcgta aacgcgtctg 

9 7501 aaggtaatgg gaagatgggt gataggagag aagggaaaga agaagaagaa gaagaagaag 

9 7561 agaaggaaga gcagggtgaa gaagaaaaca agagcggcgc cggggtgagc gaaagggaag 

97621 gaagaaccaa aacaaatgta ttcactgagg gcagccatgg gcgactgcca tagtgatggt 

9 7681 ggaagtcgcg cgcgtcgccg cagtgaattg gagctccata ggaacgcgcg gggggagcgc 

9 7741 ggggtgggcg aggggcgagg aaggcgcaaa gacggagggg aggcgatgaa gccgcgagaa 

97801 gttgaggaag agacgaagca agtaatggag cagcagcaga tcagttcaac ttccgaggga 

97861 gcttggcggc cacttttcct catccagcat gtcacgtggt gtgtgccggc tggcggatgg 

97921 actatgcatt tgctgttcac tttctgcggg tattcccacc aaacagtgaa catgcgatcc 

97981 atggttcctc ctcaactcgt catatcttca ttctctagga ctcggtatag gatatcagta 

98041 cagtgcttag actattgact actcttcctg ttctgcctat acacatatat cacactattc 

98101 atccacatcc acatccacat ccatcggcaa cttggctcaa cttccacaat tagctaccag 

98161 tagatcgcta ctagtccgtt ctcgttcctt atcaaggatc aataatagcc aactagacta 

98221 aaccaccacc ttaaaatttt agactcccta ctagaaatac tactactagt atcaactctc 

98281 attcgcctcc actaatggct aaatatccca ccttttaaga acaaatagaa aagagatcat 

98341 caacggtgta tgtacaagaa cggcagctat gttatacacc aaggactttg gttagtccag 

98401 cccgctgaac cgtagagaat atagcaaaag gtttacagaa ttcgaaatgg gtgatggccg 

98461 tcggacccac ccagcgacta ctaagctact atggcgccag ccctatccga tcgaagaacc 

98521 aaggcgctcc ctccatcaca atccaggcaa tgcctagtca aacacagtca gcatccgcaa 

98581 ctttataact cgacgcacag gggttgacct accggaatat acaacaagtt tcgttacaac 



98641 ctctacatca tagcgcaccc ggggctgttt gatctgcgaa aacacttcct tctcgtcggc 

9 8701 ttcaatttcc gactcaaacg gaggaaaggg cacggtcgaa gaagcttctt gtgagctaac 

98761 atcgacagaa ctcgccacag caggggcacc tgtgcccatc atattctcat actcgtgtaa 

9 8 821 cttagacggc tttcgcgaca gtttccgtgg ctcggactcc gaaaccggtc caccttcttc 

98881 cggcactgga gatgcttggt tggctgttga aagactttcc cgtctccgtt tctcgcgata 

98941 ggccagtgtc tcatacgcat cagcttctga ctgcggaccg atcgaaatag cgcgtcgtcg 

99001 gggatgccgt atcgtggtta taattgaagg tatctccgaa aagcccggca gcacttcgtg 

99061 gtcccggagc tgggatggga ctttcgtgta ttgcctgttt tgctcgttac gttagctgtg 

99121 ctctattttg gcagacactc ttcgtcttct ttttctgttc aagggcgcgc aagtagactt 

99181 acgaggcatt cctaaagaat ctccggggaa gtatgaggcc cagcttttcc aggccccgga 

992 41 agacgggagg caaagcccgg agcaaaagag gtttggtgac ggctcgccac gtgaagagca 

99301 taagaacacc aatgacgagg cggagagacg tcttgaccag tcctacttgc cgaaattcaa 

99361 atggaacagt ggccggcaca ggttcatccc acactaggct taacctggct atatgccaat 

99421 aagccacttg tgctccaagc atgactcctg caaaggccac actatcgtcg aagcacgggc 

99481 agtcatcggc aggctctggg tgtatacgaa ccaaccccaa aatcaagaga atcactagtg 

99541 tagcccccct tccggtagca gaaaggacat actcatcaat ggtcggtcca agcgtatact 

99601 gtaggaaggc gaggaggaac ccgagcaaac atccaatgac aacatcgaag aagccatgca 

99661 taccacagta cagacgaccc agaacaatgg aacagacgta gaggtatgtt atggcttgga 

99721 gagcgatgtt tacctggggg ctcagggtgg attcggaaga gttcaatagt gcgagagcgt 

99781 acgccgcgac cgagaccgca ttggtggaat gagtcgaagg aaatccgtat tccaacgccg 

99841 cagagccgga catcgttatc cgttggagag gaggggatag aggtcgcggt aggcatagca 

99901 agtccttgat gaaaccactg aaaaacacgc cggaagccaa aagatgcacc attctgagaa 

99961 aatgagtcgt cagctcgatt gcaggccaga taatagtttt ccctctgttt aagggctcgt 

100021 catgggcacc tacccgcgcc ccaagctcgt atacccgcac caaaacaaga tcggcaggaa 

100081 aaccatgaaa aaggtatggg tgcccaggtt ggccgtgaag gcaaagtagg agtccagggc 

100141 cggagtacga acccgctcct ggacccaagc cagatacggt gtctcatacc gaatcagcgg 

100201 gaggagcttc tgtcgtggcc agtaacgcca ataaggaagc tggtttcgat ctgataccaa 

100261 ttgacgagca gtcgcggtta gtcctctgaa gacaattgct attccacatc ggcaccacat 

100321 acaatgatcg aggcttctca gcccagcgtc cggctggcca ttcttcccat tgaccatgct 

100381 gtcgtagttg ttcaggtcaa ggcagccgat tcaatttagc gtgatggtat gtttgcgaca 

100441 accaaccccc cataaggttc ccgacagata gcagtctcaa cacccacaac gggctgataa 

lOObOl gatcattgag cactagagtg gttggagagg aagcccgtgc tgttcggggg gccacttgct 

100561 atcaggggga gaaaaggaga gcagcagatc gcggtaggct aagtcatgct catccggcgt 

10 0621 tatcgcagaa aaggtcgtgt tcttggcggt cgatgcggcc gtgatttgtt caaccactca 

100681 accaacactc ggtccttacg tttcccccgg ctgagctaag gttttagtca cccaaacacg 

100741 gcagcgaaca tgatcactct cgaaaaataa ggtgatctat ctagaatcgg atctacttac 

100801 agccaccacg gggctgtcac ttttggagct gcgaggttaa ttagaatagt ctggttacag 

100861 catgagaaaa tctggttttg gtatgctatt ctgtatagtt ttcatggtgg cttaggattt 

100921 ggctcagtct tctagtagcc gttttaagta ctatctagtg gttcgaatat ataaaatcta 

100981 ctactagtaa cgtcacgctg cctaagtaaa tttgcaacca ctagaaactg aaccgcgtcg 

101041 agtcatcagt actacgaagc atactaatta tctgtaaggg agcagctttg cgatcaaaaa 

101101 caatactcct ttggccgaaa atagtaatgc agagtttgcg tgcatgcgac ggatcccagg 

101161 cgttcctgtc gggacggtgt ttctagtcgg tgcttatgca ctcactcgac cgcagatcaa 

101221 tgcagcaata tatatatagt agtagtaagt actccgtagt taagtactac taaacaactg 

101281 cgctgtggta aacttccgga ccaaatacga tttctttgat tcatgatata ttctttcaag 

101341 tctgtaggat actactagta gagactaata gctaggaaat gctagagcat tgacaattga 

101401 catcgtgtcc tagtaactac tactactacc cggactgaca cgtcccatcc gtagtgtacg 

101461 atatctgcga ctctgatcta gacttatgtt gtagcctagc ccgtgcccat agtgttgagg 

101521 ggcactgcac tagattgttg ccccctcctc ttcagcacag cttggcatgg tcctgagcta 

101581 gaccgtttgg acgtctaaaa gtctttccat gtggctcagt attggtttcc ttgtgtgcca 

1016 41 atggaaagct gacaatgaca gcgacgcgtg gaatgagtcc agccgggata ggaatttgca 

101701 ggcatcggtg gcggaggcta gaaggccgtc agaaatggag ctactatgat ggctataatt 

101761 ttggcaccat aaggagtgaa attcctagca gataggtcgc aatgcgttca gcagcaccta 

101821 cgaggaattc gctccatcga gccggagaca tggaacagcc tagccaacat gtctgagtaa 

101881 acagaggagt atctaactac gcagtaatgg aaatactaaa cgggtaggaa taattgacca 

101941 ccaaacaatt gcctgtagta gtgattaacc tcggcaggaa caggaattaa tcagttcctc 

102001 ggggccaaat tcaaggctga acgtgacatc ggaagatatt acgtcaaccg ggtcttctgt 

102061 cgttgctact ctggattgct gggtcaaaag ctgtttcagg gacctggacc gtagacgaag 

102121 acagtgtagc acgggcctgc aagggatcgc accagaagga aaccttgatc aatgttatcg 

102181 ataatcgggc ccaaccctgc cactaaagcc cgaatgtgat tcgaggatcg ggaacgattc 

102241 caaggaggac ggcgatagag tggtggtagc ccggctcggg ggcccggcca tcgtctagga 

102301 attcctggtg agcaactcca ggtaaacaac caaggaaccg actgaacatc ggctgaatct 



102361 gctaaaaata acaagagcgg atgaggcaga agacgaatgg aatcaatgga tatgcgtcca 

102421 gctgaaggtt gcgtggagtg acaagcggag ctcgtgcgtc atttgtcccg cggcgttgcc 

102481 atggcagccc atcggccggt aaacatcggt cttcgcggtt tcatgtcagc cacgacggcc 

102541 tcgttgttgg agcagccgtg ggaggaggct ggctggctgg ctggctgggc tcatcgtgga 

102601 gaaactagca ggagcatctc gagcctgaca tccctttctt cccttcagaa cctcttcgtg 

102661 ctggtttcgc gtcttgtttt cctaaatcca ccaacctcct ccccctcctc acctcccaaa 

102721 taaccaccca ccaccccccc cccctccatt aaccttttgt acagaaagcc actttacttc 

102781 gacattccta acaacgcagg ggaattgctc aacacacacc gacccagaaa actttccgct 

102841 gtgcttttga caagacagtg gatcgttgag cagcaattat taccgagccg catttcctga 

102901 ccgacaattc ccaccccttc cggaacggca ctgtatactc acattcctcg aaattctcct 

102961 actccgccat catgtcaggt acaatgtacg acgagcacgt ttatgcaccc tcacggcgcc 

103021 attcagttgc tactcctccc ccatccatga ctccacgaca tagtcggacc cgcagtcaaa 

103081 gtgtcagagt tagcaacggc accgtgagca caaatacaag cgtttcgagt gggaggatgt 

103141 cggaagctac aaacattaca catccacctg cgtattccaa gaagtttgtt gtggttggtg 

103201 atggtggatg tggaaagacg tgccttttga tcagttattc ccaaggttat ttcccagagg 

103261 taggttccaa tcatgtctgc gaacttcccg tatcgttgct aaactaatca ctgtgtattt 

103321 agaaatacgt tcctactgtg ttcgagaact acatcacaca aaccacgcac cgagcctcag 

103381 gcaagacggt tgaactagcc ctctgggata ctgctggaca ggaagaatat gaccggctgc 

103441 gtccactttc atatccggag acggacctgt tgtttgtttg ctttgcgata gattgccctg 

103501 catctctgga aaacgttatg gacaaggtaa ttcccaaatt cccagccgtt ccgcgaccag 

103561 ttatagtagc taataccggc cgcagtggta tcccgaagtt ttgcactttt gtcccactac 

103621 cccgctcatt ctggttgggc tgaagtcaga tttgcgctcg aagagaacgt gtattgaact 

103681 tctaaagacc cagggtctaa cgcctgtgac gccggaacaa ggtgaggcgg tggcgcgcag 

103741 gatgaatgcc tcttacatag agtgtagtag caaagaaatg cgcggcgtcg atggcgtctt 

103801 ctccctggca gtcgataccg ttgtctccgc cgaggaacaa gactggaatg accgccaaac 

103861 gtctactact cccggaagca aacctaaagt cggtggtggt ggcggaggaa agaaggtcaa 

103921 aaagcgttcc tgcaagattc tgtagacttg ggtctccttt ttgatgtttg tctggtcatg 

103981 atcatctgcc ccttttggtt tcgcctattc ttcctcttgc gatgtctgtt tttccgccct 

104041 tcctgttgta ccgaatgaga agcctggctg cttcgacggt ttcccacgct ttttgtatgt 

104101 gacatgacat gacatgacca agaggccgtg aagatgaaac acactgccac gatgcacctg 

104161 tgtctcctac tatgccctct acctaccctc ttcacctgct atacatgccg gcggcttctt 

104221 gaatttttgc tccctccatg ttgctatacc tgttcgaccc tgattcccgt ttccctcaat 

10 4281 taccattact cttatcatca tatccttccc caccccttcg acttggcgat tgtttttctc 

104341 tccctgtgtc tgaatgcttt atgttgtcac gaatttggtc cgatggccgc tcttttgcca 

10 4401 tgcatgggat gggtgtttac tggcgcaagc gaaaatcccg tgtacttgaa ttgttgaact 

104461 tgttttgttg tcaaactcat aaactctgtt attgccccca tatggtatcc gtatttcgat 

104521 atttccgata gcaactagag taaagcaaaa atggaaaaac gtaacaaaaa tagctactac 

104581 aacagtaaaa gcagaagtaa aaacaaaaca actcctctac catacgccac caatcaacca 

104641 atcgaatccg tacccacaat aacaacccga gcctcaaaac taaatcatag acccatctat 

104701 ccatcttcac cgttcaccgg tcctcctcct ggagctgact ctgctcctca tcctgatgat 

104761 gcccaccccc actaccacta gcaccagcct tctgcttccc accatcccca gcctcattct 

104821 cctccccagg ccgacccata aagttctcga actgggacca cagcgcttcc acagagccaa 

104881 actcattccc aacctgcaga tataatatca gcttagtaca taataggcgg gacctgatcg 

104941 gatctgatct gagctgatcg acatgcgcgg aatggatcgg agaaagaata agtggggggt 

105001 gggcgagtga ttacccgtct gtttgggtca tgcacttacg gaaatgatgc tctctaggtt 

105061 ccggttcagc cggttgatgt tttgaaggac ttgttccatg ccctggaagt ttgattattg 

105121 attagatggg aattgggaag taggttgcaa ggagtgcata gtatagataa tagggaggta 

105181 gggaaatact caccactgcg atctccctaa ccaattctgc ccgctgctgc tcgaagactg 

105241 ttgggctgga ggatctgttt ctgggaattg gtgttgttga cattgtcgat gggtgtatgt 

105301 gggcgtgtag tttaggttct ggattgggtg gaaggaatca gtgacaagag agagaagata 

105361 ttggtaagtg ttgctaaaag gcttagcgtc tgatgacttt agtctttctc gtccttgtat 

105421 gatattgctt gatagcacgc gtgttgattg ctttcgattg tgattgtcgg ttgtggctcg 

105481 aggcggggcg gttcctgtct ggaattgcct gattcggttt ggtctggtct attttgcccg 

105541 cagtggcagt taccgccttg gtggccgcct tggggcttat cgataaggga ttaatggtat 

105601 ggatatgagt ggagaagtgc attggcaatg cgttgggcga aggtttggga atctgggtag 

105661 tattcgatat ctattcatat gcattgatat aattgtcagg gttcacgact ctactgtatt 

105721 cgagcgtact tacccatcgg agctaatcta tggcttgaaa aagagttgcc gcaacagcag 

105781 cacgcaaatg ccatcttgct cgagcattac aactaacgat catacgtggg agcgcactca 

105841 ggctcgtatg tgatctcctc actgacacca tgatatctac cctaatttga gtccagatga 

105901 attcaaagaa aatggcgcag atgttccaca gtccactccc aaaccttcat cgacttgacg 

105961 ttcctctgac gtaacgcgtc gccaagtgtc tatataacat ggaaagaata tcagacaata 

106021 atgttaaata cagatgaact tactcgtccc atccgatcac aatataaaga tgaaggcaat 



106081 cggcgtggag aactacggtc ctgttgaccg tcttgaagct tgggaagtcc cgaagccgac 

106141 agatcttcag cctcgggatc ttctggttcg gtaggatagc ttcatttttc taattccatg 

106201 tagtagtcac tgacatttac ggacaggatc aaaggtgttg ctgtgaaccc tgtggacacg 

106261 aaggtacgga atggtacata tgacgactat cctggtaaga ttcagaccac attggccgaa 

106321 acacagaagg tgccggttgt ttaacgtctt gacagattac tacgaccgcg tccctcgtcc 

106381 attccagatc accgggtacg atgcctcagg cattgttgag gctaccggat cacaatgctc 

106441 gcttttcaat gttggcgatg aagtatacta ttcgggaagc cccatcagac agggatcgaa 

106501 cgccgagtat caattagtag acgagaggag tgttggccgc aagccgaaaa gtctcgattt 

106561 cgtggaatca gctgccatgc cgttgacgta tatcacggcg tacgaagctc ttgtcgaacg 

106621 actggagatt gagaaggaag aaaaagcagc ccttctcatc attaacggcg ccggcggagt 

106681 gggtgctatg gctacacaga tcgcacgtgt tgtgttggat ctccctgtgg tcattacgac 

106741 agcgtcacga ccggagacaa tccaatttac gaaagacatg ggtgcgactc acgtcgtcaa 

106801 tcacagagag gatgtcgcca gtcagatcga gaagctcaac ctcgaagtac cgttgaagta 

106861 agacgtgtct gtgagagtcg aaattaatcg ctaacaacaa ccaaggtata tctttattac 

106921 acactcgaca gaccagtaca tggacacgtg tgctcgggtt tgcgcgccgt tcggcaaggt 

106981 ctgctcgatc gtccaaggcc aagccaagat gtatggaacc gagttcatgt cgaagtcgtt 

107041 gacgttcgtc tggtgtctgc ttggaaccaa gccataccat aaggtggatg tagaatcgca 

107101 ccatcgcatt ctggaagagc ttacagatat gatcgatgcg ggcaagatca agtgccatct 

107161 cacgaagcga ctgaaattga cgcttgacgg cgtgaagaag ggacatgagc tattgcagtc 

107221 tggagggagt ataggcaaag tcgggctagg tgtagatgag ccaggtcctg gagagatctt 

107281 tacatagcaa tgtgacatgt agcctcactt ggcagttgaa tacacggaga tatgaactac 

107341 agcagtgcag gaatatgaat agccattatt gtttctttcg tcttcttcac ctcaccgcca 

107401 atgggatata ctaatgtgcg aagatcggcg cttcagatat tcgaggtcat ttttctttat 

107461 ccacaaacca ccgcgaccac cattttcttt cccaactacc tacatggagt cgagctacaa 

10 7521 gagagctcaa ggtactccgc agcatccagg attgacctcc ggtaggcaag gatgtaatga 

107581 acaggacatt atgatcctcg tggtatcaca gatgattccg ccataaggtc cttgatcagc 

107641 tgcatactaa caaaggcctc aacatgcaag ttaactcctt aactagtcat atcatacgct 

107701 aatttaccga ctccacaaac cctcgtcaca atctcggcgc catggtctcg acgaaaagct 

107761 taccctttgc tcgcaacacg tccgagactt gcgcctctcg ccacggaatc agagtaaccc 

107821 tgcaccaacc tccaccaata atcccaccaa acaatataca cagcccagac tacatgccac 

10 7881 gcgtacagag agagccaagc gaacagtgct atcaaacgtg gaagataatg agcagtggca 

10/941 cagacactcg ttacgtctta tcagacgtgt ttccagtgca tcgccactga gaaaaagcca 

108001 gtagaccagt taaatatacc gttcagatgt aattcatatc tccagccata caagatgcag 

108061 caaggcgtgc tagtcgtgat aatcaatcac aacaaccaat gctctgggat ctccatcacg 

108121 ccaccgtcac cccattcttt tccaatagaa ggatccgcta ctcccacgtc aagcacagaa 

108181 gaactacaga gacacgtgta tttcgccgaa gataacggtg ataattggac ctttgagcgg 

108241 atttaatgaa caagtgaggg ccctcgatgt ggcaaagaca gtgctcttct atctagtgcc 

108301 cagggcctta cagaagggga ggtcatactc catgattatg gtatcgtggc aatgaatctg 

108361 agtcttgcag tctcaacaag gaccctgtag taaaactata caaagtgctg gatgggatta 

108421 aagtggctgt acgcatgtgt gtgaattcta aaaattgact tgagcaagga atcgaccaag 

108481 caggcaagta agatgagata accacgaatc acagtgacaa ggagttgatt gcgcactaca 

108541 gaagatagaa acaagtcaag tagccaaact aactatacat gacactatag taaaagctcg 

108601 aataaatcaa gctaataccg gtaggcaaag tggataccat catcacaatt gatcataacc 

108661 caccgggccc agttgcgtac ccgtggagag aaaacaagtt cagacccaca atcatagata 

108721 gcatcataaa actaatttct ctgcctaact agctaactcc aaccatttac tttttctagc 

108781 tagctagcta gctttctaga aagcaaacac tgaatagcag ccatttgacc cccccataac 

108841 atagtgattt accccacctt ctcatgctat gacaatcagc acaactatcc aactactagc 

10 8901 tagatagcca gctaaatggc tagtgaggtt acgccatcac ttgcgtaagc actcaatcca 

108961 gagcgagaga accaatcaag atgattacca ccacgtagcc actggtctac actgcacggt 

109021 aggagataat tgtaagatag aagatgggag tggaggccac taatcattag tagttccagt 

109 0 81 cgatggagcg atgatgtcaa tgcacttaag ctagctagct agctaggcag gtagggcttc 

109141 aggcatcaaa gttctcgtca tcacggatga tatcagttcc gtgcggtcat agatcaggat 

109201 agacagctaa gtggaggatg acgacgacga caacgacgat gatggtgatg gttattgggt 

109261 atcaatcaga gagctttatc ataaacgtgg ttattgtggt aggtcatggg aatttttgga 

109321 acgacagttc taggaagcgc tctcgagtgc aagatctgga agtgaaaaag atgtagtaag 

109381 aatagaaaga atatcaaaat ccaaatccgg tcaaattgat tctgcctact ccgtcgctaa 

109441 aggacgaata aaataaatcc aaccaaattc agtccaggcc agctcaactc aatcaatatg 

109501 tttgtaactc tttatccgga tctatcgaac tcccgagtca tgtccgtgtc ttcgtatcgt 

109561 ttcgtcttgg cttttatcgt ttctttgtgt atatatcgtg tcgtagcgta gcgcatcatc 

109621 tacgtaattg acggtcacca ggaccgtgag gcttttaggc atagagtgga agaatgggga 

109681 ggagaggagg aaaagatgaa gtgtatcatg tcgaacctct tctagctgga acgaggatgg 

109741 tgctgagtcg cttccaacgc tgtgtaggtg aagtgggtgc atcgcgatct agacggttgt 



109801 ctagactggg cttcgaagag gattgagtgg aggatcgggg tcttgtgaca agcgcttcgt 

109861 ggtcggctag gacatcctcg agccggacat cgcaatcgga catgcgcggc cgagtatcgt 

109921 tggggtcgag aatctcgata aatttggacg acatgtcgcg catctttcga ttgataagtg 

1099 81 acttggtaga gggaagcttg tggtggttgt cgattctttt ttcttgttgg ctggaatgcg 

110041 gctttaacgt tttctttttc ttgtttcctt tgctacggcg gcgatcacga ggtggatcgg 

110101 ggaactttgt tcgtgaaata gtcggcggtg gaaaaagtaa acccaaccgg gtcggagaag 

110161 taggttcgtg atggaagtta agggaataac taggagcact gtcgaccgtg aagcgaagaa 

110221 gatccggata accgctggcg gatatatctt ccgtgcggac agagagtgga cagggagttg 

110281 gagatgagca gaggggtaaa cgactgggac atgaaggaca aaggaggaga aggacgagaa 

110341 tgaagatgaa aaaggcacag acagtgcagt agcgggtgat gccgattcgc tgagcacggt 

110401 ttcaagtatg cgccaggcag ggatgtcgag cggtcttctg cgtctgagcg gatgagctga 

110461 cagagaatgg ccctgaaaga aatactcgtt gttggagaga aatctaggcg ccgttgacgt 

110521 ccggggaaga tgggtagtat ttattattgc agacgaagaa ttttactttt gagttcctct 

110581 atgatgaggg tgtacccatt gacccgccca gtcaaagtgc cgcgagttcc gcatcacaga 

110641 ggaaaaggaa aaaaagccaa attatttgcc agaacacgcg ggggttgggg gatgtgggaa 

110701 gggatcaact gagggactgg gacggtggcg catccgccag ggtacgagtg agcaagggca 

110761 tcagcggcca tatacgccgt gtgcccgggt agagcatgtt gtgatgggct accggaccct 

110821 ctgattgacg gggctggcgg gggtccatcc tcaggaagga gggaggagtt ccgacgatgg 

110881 ttcgtccgtc ataccgaagg gtgtgagtgt gattgccggt gctgatgaag cggatgatgc 

110941 ggtggatgcg tggcgttggg ttgggtttct ggtcgtagct cgcccgatca gtcgtcaatc 

111001 gtcaatagtc attgcgatca tgtctgtaag cgcgttcgga tggagctcct gttcccttaa 

111061 ttcctgatcc agtaagacgc gatatcgact cgactcgact ccactgccct gcggaccctc 

111121 ccgcccgtcc cggagagtga gactgagagt gagagcatga cgtcgcgccg aatgcgatcc 

111181 atgggtcgtt ttctgcagag ctatccagtg attctagacg taagcaagag atactggttg 

111241 tcctcaacca ctctatccat tgttgtggtg gggttgggag tcatcgcaaa gggggctgag 

111301 gtgtccagct ccatcgttgc agaggctgtc catacaaatc gcggggatga cggcagatcg 

111361 gcttatcaaa tggatacggc cctcagacga tccatgtgca acgcaacgag tccatcgatc 

111421 agcggagtta ttaatagact gataatggag aggagtggaa tgcgtattgg atgagtctcg 

111481 cggaggaatt gaaaatgagc cacgaaaaac tccacgagaa agttactcca gagtcacggg 

111541 atactttccc ccaaagcatg cgctcgccgt gacgtcatgg catcgttgct tcgcctgtgc 

111601 ttcgcctcca ttcttcgcca gtcgcaacga ttactgtctt actgttggct gtctgtcgct 

111661 ggcacaatgc cacttcatca tcgcgatgcc tactaaacgt gcaaggaaaa atccatcggt 

111721 ggcacagaaa ccgccttccc ctccgctgtc cgagaagaat gccgtcatca cgcaagatat 

111781 cccagaggaa gagggagcaa agcgagaatc tacccttcga gatgagtcgt acacacctga 

111841 gctattcgac gaagacgagg agatgcccga agaaacggag aatactatcg accccgacac 

111901 tgggaaagca gccaaaagaa cggcggaata cgcactagga gggggagggt acaacgcact 

111961 caagtcgtac aaaggacaga tgtactccgg aatggctgtt ggaggatcgc acacctggac 

112021 atacgaccaa gggacgtgga aggaaaccaa ggaagaaccc gatctctgga gaatcgacta 

112081 ccagacgaac aaaagacgag cgaggaaagc gcctacaggg agtggtgcgc cagtgggaac 

112141 cgagtatcac tggctgattg taggccatca ggtaggctac gccacttcat cagacgacgg 

112201 ctggctcacc cccacagcat gtgaagaaaa tcgatgccaa tacgtacgag acgcacctca 

112261 caggctcgaa gtataaactg gcgtacaagt ccgcgtcatc gaacgcttgg tcgattccga 

112321 cagtcaagaa gcaacgggag cgggaagtcg aattgctgga tgacgccaaa caacgcgtcc 

112381 agggactgcc gcctgtgctg gcatcagaga aggtcaaggt tgagaaacgc gaaaagggcc 

112441 agcaaagact cgacagcatg tttggaaagg ctgcaggcgt aaaacgaaag gcggatgaaa 

112501 atgcctgata cgcgttgctt ccgacaaatg ctacgtgact aggccatgtc acgtctgctt 

112561 cggcatacaa tctacggcaa tcattcggat ttacgggtcg aaaaatgaga catgataatc 

112621 aatccatgaa gatattatgg acataagcta aaactggctc gggcgtattc cgggccatat 

112681 tttgcaaaag tttttactgt gaggctattc agcagggtca aagcaacaat ctagatcctg 

112741 catgataaaa ctgcaagacc gacgctatgt agcatagcaa cggagtcgag aacgtaagac 

112801 aattggatag ggcatgccca ctccgtgtac tctattaccc gaaagcataa accaacccaa 

112861 gtatagagcc agatctagac tcacccgcga acaaacaaat ctcaaccctc atggcacgat 

112921 ctagaacacc aagaagaagt caatctccga cccacaatct catcctattc cttggaaccg 

112981 tgacatcatc caccccaaac ccaatataag cttaactttg gcctcattcc agacctcccc 

113041 tagaccatct acccgccggt ctaccttccc cgttcaactc aacttcacat ccgccaccca 

113101 accacccgca tttccaccgc atcatgtcca aacgcatcag caccttcttc caacctcttc 

113161 cgtctaagaa actcaaaccg gatgcaacta cgacagcaca atcatcatca tcatcaacaa 

113221 caacagtccc accaaaaccg tccacctaca cccatcaccc aagttatccc atcccaatcc 

113281 gcgacccgcc cagcaccatc gccacgcctc tcctctccaa tccagcacct cgacctccca 

113341 aagccttaac caaccacccg cacctggatc tactctactt tcaacccttt ctcccctccc 

113401 cactggcccg agacctattc cagttcctgc gaaacgaact ccccttctac cgcgttcgat 

113461 ataccatccg gcgcggtgga acagaaacat ccatcaacac gccaagatac acgactgttt 



113521 ttggggttga cgatacatcg acatttgtta ttgaaaactc cgaatctact agttccaatt 

113581 ctgagaccag tgtccaagac gacactacaa ataatctact agatcctcct ccagccaact 

113641 caccaatatt ggtcgacagt aaaacacgca ctcctctcac cgaatccaaa tccaaatcca 

113701 aataccgatg caggcctcga cccatcccgc cgtgtctgga cattctgcgc caagcagtgg 

113 761 agaaggccac ggacgatggg accagatata atttcgtcct ggtgaattac tacgccacag 

113821 gggacgatag tatatcgtac cattcggatg atgagcggtt cttgggccag aatcccacca 

113881 ttgcgtcctt atcgcttggc gcggggagag atttcctgtt gaagcataaa ccaggtgctg 

113941 ctggtggtac tactgatact gatcgtgcga ttgcaaatgc agctgcgaaa ccgctcaagt 

114001 ttcccctcaa gtccggcgat atgctaataa tgaggggcga gactcagtca aattggttgc 

114061 acagtgtgcc taagcgcaag gggttgcagg gaagtgctgg tgctttgggc cggatcaata 

114121 tcacgtttcg gagagcggtg gtgcctggtg gtacggagaa ttactatcga tataatgtgg 

114181 gcgatggaga ggtgtatcgg tggagagatg aggaggggaa aatggttgct actactgaat 

114241 agtttatgat tatgttgtag cttcatgcat atatagatac acccggattg gagggatggg 

114301 ggaagggaga cactcagtgc gaagcaaacg ggttgctgta agctcggaag cgatcaagtg 

114361 gctgcgcact gctgaggatc ttgtgcaaca acatcccgta gtcccatccg atagctgcgg 

114421 ccgcaagagc agttagactg gcctggtcct cgcgaccagg tcgtccaggc ccggtcatat 

114481 tctattgttg tcagcgacgg ctcagcaatg gacagggaag gacatacagg cttcatgttg 

114541 atatcgaaga tggcaaactg tgacccctcg ctaaagcgtc taatgtcaat gcgaattggt 

114601 gccgtcgcgc caattaattt ggcgacttgc tcgcactgcc gcattgcgct tctgtacgac 

114661 tcatcctgca tttcgctttc tgtgacagcg cgcgagttcg cagtcactgc cactgtacca 

114721 ttgtatggcg caatgccatc gacatggtta aaacgcgtga tcggaggaag gctccagtat 

114781 tctgggccct ctctcgatgg tggcataaca gtgatcgttg cctcttcgcc cgtgaggaac 

114841 tcttccacca tgacaatggg cgactcaagc agcagagcgc tgatatgttc ctcgagctct 

114901 gctcgatcat gacatacctt tacaccgtgg ctgccgcgac cacggacggg tttgccaacc 

114961 acgggataac agggaatgga gtctagtata ggggaaatgt tatccgaatt gctgatcagc 

115021 catgagcggg ggagcgtaaa tgagccatgg atacggagct gatcattgag ataggctttg 

115081 tcatcgaaag attcgactag ggttggtggt tgcccgacaa cagagatagg gatcgaagca 

115141 agagaccgtg aggtttgcag tgggtgtgag gagaaaagga tcgtgttggc ccaaaggtgc 

115201 gttgcccctt ggcgcacggc agagaggatc ccgttttctg tatccgggaa gcaccagcct 

115261 tcttgctctt tgggacttgg ggaagagacc ggtgtcacga cttgtacgcc acggtttcta 

115321 agggtgtagg cgatgtcgga tcccgagtct tggtatccta gcatttatgt taggatgagt 

115381 ctttgggata gcaggagggg atgtgatacc gccaggcttt cggggcttcc gagcgccgtt 

115441 gatgatgggc ggctcgatgg cctggtagag gactgcgact accggagggg cagtcgcatt 

115501 atgcaaagac ctgcttatct gtgtcatggg tagactcatc gtgttgttta ggtggttagt 

115561 tgcctcgaag ctggaaatga cgaagtgtgt tttatagatg tggagcaacc cgataaaagt 

115621 gccggcatcg gacaaatcca gctgatcctc ggatagatcc gaaaacacgg agtaacccgc 

115681 ttatgccgat tgcactcctg atagcttcat tgtgttatgg aatcaagcga cgattgactg 

115741 ttcaggtgcg tagaaagatc tggcgggagt tcttatgatg attagcgatg catcgtaagt 

115801 agaagttgga cgacaaatgt ttcacttacc ggacccttga aagtcgcatg ccctgtggca 

115861 aattggcaaa gtgggtgatg aatcataaag tagaaagcgt agtcgttcct tccagtatca 

115921 gaaacctcaa accgatgctt tgatagttga ggttttggac attttatcca gctttgtcgg 

115981 atgctgtctg ctggtgtacg ttcatcgttc tgaaggagtg actgatgtga tgtctgaaat 

116041 gaagagcatg gattcctcgt cgataagagt tccgtaatca cagagaattg gtcttcctgt 

116101 aaacaaccat gctagcttaa atcttgacag tgaaagcggg atatcgtact tcaaaggtct 

116161 ggcgacagct gtctggcaat gccgagtggt tctcaatgct gcctcgacct gaaaaggact 

116221 cgaaaccaaa cattgagaca gagaaggact tgtgatatgc ccaaagaagc actgctacag 

116281 aaagaaacgg acggtttgga acagtatcag agcgatttgg tgtgattgga gtctgctggg 

116341 ggtgggcaga gtgggcctcc ataacgagca tgaacgcact agacaaacca agcaaacgcc 

116401 ggtgactgga tccatactct caggagtaca ccccatcctt caagcatcga caccaattca 

116461 tggcatatca gtctctctcg cagctctaac tacgctaaga cataagcaga gtggcagaga 

116521 tatccatttc ttcatggccg cgaagggaat tgtctgtatc ggaacttgct agggagcatt 

116581 ttgaccttgt caagcggaga atccagtcac tccctgagac taatatagcc atctgaacgt 

116641 gagatcgtct taccacaggg tttgttgcaa atttgtgtac tcattatgga ttggaaacta 

116701 caagactaat gtttcgttta gcatagatct tttcgaaagc tgcctaacga caatcaccat 

116761 tcagcttcct tgatcaaatc acatacccag aaactataag gtgatggaac acttgttgca 

116821 tgtttcgagg ccatttctgg cttgactcaa attatgccag catcatgttt tgcacggtat 

116881 agtaacaaac cgtcaagaat gtgactctca gctgggatac tcggttacct cggagctagc 

116941 gatattatcg aaagtatatc aggcaccaga aattctggga tgaggctaaa cacagatcta 

117001 tttggagcaa gcatgatgac atgagcaatg acagcgtgtt tatgacgtat cacatcgctg 

117061 taatggcttc agttgctccc accgtatgtg tctgactcga tttactgtgt ataggacgat 

117121 cgatcctgat ggggctcagt cggggttaag tttccttcgc gctgactata ccgtttgaat 

117181 caagtcgact accagccaag agaaaccatt gtctaggccc tttgatcgtc agacacccat 



117241 gaagacggac ttcgcttctg cggaaatccg ggtagggata taacaatccc ggtctttgaa 

117301 aaaaaattta ttctgacata tcacatccgc cggaccgacc gagacattgt caatggatgc 

117361 agccatgaca cggtgctttt gtatccgcat gtcttgagga tgaagcgcct tcggatatgt 

117421 ccgaatgtat gacggtaaat gattgataca tcacctcacg gatttgccga tggcagacat 

117481 ggctaaccta agatggccgc actttctcca ggcagaagaa aaaatgcgct gggagctatc 

117541 ctgggctaac ctctcaacgc ttactctaga taatgccgat ctctgaactg ttaaggctga 

117601 tggaacacga aggccgagtg cattagccca cagacgatcc gtaatctcga caacctgtaa 

117661 aaaatctgac acggttgctc aaaaagacac aacggaaaca aagatttggg agcctcgagt 

117721 ctcactcagc ctaatgctgc tgactggcgc ttcaaacagt agtagcgggg ttgttggctc 

117781 agctggaaaa agtcagcctc cttgccccta agatttccaa ggtataccgt cagccagaca 

117841 acggtgttcc tggataatcc cggagaaagc ctccaaatca gtttaaaacg tccatttttg 

117901 gtacttctta tacacggcta tggaacaccc gatctcactt tgtcagcgtt gcagggttct 

117961 ccagttcaac gagcaagact atggcaccgt cgaaaacgag tcattgggga actgcactgt 

118021 tgtgtttgac cattttgatg atgaagatta ttatgaggac gatccattct atgaagatga 

118081 ccaggatatc agaaggcagg aaatttcgct tgactaccag ctccttgaca agtttccaga 

118141 tttttggttc ttacggcatc cactgcatct ggatgtggta tgtgtacttc catgcatgag 

118201 aagatactat gactgagggc gacttatctt tcatcggatg gcgatgcatg gtggaaaaat 

118261 atccctgtcg gtctttctcc acccaacacg acagattgtc agcattgtct gcaatggctc 

118321 aaaagcttgc attagagaca aagggaacac agttagcaga aatttggata gaggaatttt 

118381 atatcaacct tctctggcca ccccaaggca tgagatcatt ggaagccttc ggaccccgct 

118441 tgatgatgaa gagtcaatag caccatcatg gagctgggtt agtaacccaa acccggtgaa 

118501 atggggtgga gaaaccaata atttcgatcc tctgttcatg tgccaatcta agattgcatc 

118561 gataaatatg aagactaact tgaatccata tggacaagtg caaaagggtt ctaacatctc 

118621 tcactgttca ctaagtttca ccctccaatt ccttcgacca tagtacgcag tggagggttt 

118681 cgtcttggtc atgtcgagtt cccctgtgaa ctgtgttcag ctgattgggg gagtacgttg 

118741 ccgatgttcg cttcgactgg acccatgact catccgaggc tcagacacac tccacagaaa 

118801 aagagaaaga catcctgaat atgttgtcta tggtcctgat attgagatat gatggtgaag 

118861 aggaagactt catgatgggt ttgcttgtgc tgcctacgaa tatgcctagt gtatatcgca 

118921 gatcaggtct cttttcttgc gaaaatggtc aagaattctg gtataacatt gatactccaa 

118981 ttgtcgcatt ggtttgattc tttgagttgt aattttaagt tttctttttt tagctcggta 

119 0 41 ttattttgtt tggtttcttt gccccacaag aacagtgaaa tggagtcgag tggaacgcct 

119101 agatctgttt atcatgggta ctaaaggagg tgagtaatgt ccattagcca ttgagaaggc 

119161 cggaataggg agattgagta tgtactggcg accgacacta gatacaagaa gaagagaaac 

119221 aaagccatcg agcctaacgc aacagagcaa ccgaacagta aacaaagaga actataacct 

119281 cggtctccat ccactctcct tgacttccct gcgactctct tctctcaaca ctcgcttgag 

119341 aatcttgcca cttgccgttt tgggtatgct cggtacaaat ttgactcccc cggtgagcgc 

119401 tttatatccc gccagcctct ctagtaacca gctcttgacc atatcctctg tcaacttgct 

119461 ctcatcggtc tcaggtcggc ggaccacgta cgctcgcggc agctccgtcc ccgggaccac 

119521 atcccgcaac ccaatcacgg cggcatccac aatgagcgga tgcgccaaca acacggcttc 

119581 cagttccggt ggagcaactt gaaatcctcg gaccttgatg agctccttct tgcgatcaac 

119641 tatgtaccat ttctgtgttg cacaatcgca atacgcgatg tccccggtgt ggaaccagcc 

119701 ctgctcatcg aaagattccg cattggcgac cggattgttg aagtacccag gtgtgacggc 

119761 aggaccgcgg acacacagct cgcctcgcac tccatagtcg gagatatcgg tgccgttgtc 

119821 gtctactaat ctacatcatt atgttagcct cgacctgatg gaagaaagag agaagaacat 

119881 acttagcctc catgttcggc accaacctcc ccaccgaccc tgtctcatcc ccttcatccc 

119941 accggaacat tgtcgcgata caacatgtct ctgtcatacc ccaaacctgc gtacaattgg 

120001 cccctgggcc gagtagcctt tggaaccgtt tttgggcatc tttgtccatt ggtgccgcgc 

12 0 061 caagattact cgcccggatc gaacgcagat agccccgttc gtaagacatc ggatgcatga 

120121 tgatggcaat ggcaatgggc ggaacaatgg ccatctcggt aacttggtat ttttcaaccg 

120181 tctgcaaaaa gaggggtagg tcgaagcgcc gcatgacgta gttgatatgg ccggctttta 

12 02 41 ggctgccgaa atggctgctg ggagcagcgg cagcgtggaa gagaggaacg gcgatgactc 

120301 tgcttacctt aacacattga attgtcagca cttacgacgt ggaggttatg agatatggct 

120361 ttttgagctg aaggggaagg tacacaccct ccatggcctc ggattaaact cgaacacgaa 

120421 ctcgtgctgg gcgacaagat tcatatgcgt gatctcagcc gccttgggaa gccccgtcgt 

120481 cccactacta aacagcctgg ctgcagtagt tgtcctggaa agattctcgt catcaaaccg 

120541 tacccagtct tgttcaccat gttcgagcag ctccgtccag gacaccgctc catctggtac 

120601 tgactgccca agaacatcga atacccgaat ccgatgctct gggattgatg ccgctttcgc 

120661 cgccgtagtt attgcatcta aatgttctgg ctctgtgatg acgaagttcg tagccgaaga 

120721 gcggaaatga tggcctaatt caaaagctgt gtaggctgga ttcgatcctg taaaaatccc 

120781 tcctgcgcca acgatcgcca ggaccagcat ggtatagtag atctgtcatc tgtgagtaat 

120841 caactcgcat tggagatgga gttaaatgaa agccgtacat cattgaaaga gtgaatggcg 

120901 acacagtcgc ctttttgcac gccccaggcg cgcagcccag agaccagttg acgaacaatg 



12 0961 gtccgggctt ggttgtagga aatggagcga gagggattgt ggacatcgac gtagatctat 

121021 accggttagc ggtgttctgt aggcgagtag gaaggagttt gagaagggga aaatgtatcc 

121081 cccatcggaa tattaccggt tcatcatgat cgtatggcgg gtcagaaaag acataggaga 

121141 tcacatcggt gcagggcacc ggggccctgg ctgcaggttc aaaaatcatg ctaagtggaa 

121201 aatagtgaag gaccaggggg gatgcgagga tgcaaggatg cagggaccag gaagacacat 

121261 gatgaccgat ttataacttc agctgaatga atccagcccc gatgctgaaa tagggtttat 

121321 cgttttgtct ggaaggccag cacggcacgt cacccggagc gggcaatggc actgactcga 

121381 gtgctggaca catggctggc atggtgtgac tgggaccccc gcctgcgaca aaggcattgt 

121441 tgttcaacgg ctttacggga gaccattcaa ggaacgccag acctgcaggc cggagaatat 

121501 tgaggaatta gcgtgcactg gcctggccgg gggtgctcgg gttttagtgt tgcgcgtcgt 

121561 tgggaaaggg tcagagagac caatggctgc gaggcgagct gataaaccac ccccagcagg 

121621 caacccaccg acccgccacg ctgcatttgg cagctgcctt caagccaata attaggctct 

121681 cctgcgcccg agaagctcgc caacggccgt ggcgtggcag tttgggtcct cggcccatgc 

121741 ttatcaccga tcacagccga acctgcaacc tcggatcccc cttcgttctc cgcaccgcta 

121801 caatggtccc gagcagaaga acattgtcaa ggcgatcaat ctcccgccaa tttgactcac 

121861 cccagtacga gactctcata ctttgtcggc cgactcagtg gcagcacaag acctgatagc 

121921 gaccgcagca ctgtgctgat tgttctcacc gaagtcgaat cttccagatc cgctcttatc 

121981 cggtctgcaa tgaccgcgat ggaacgaacc tgagtgtcgc agaccatgcg cttatcagca 

122041 attcgatcag cctaatacat gatgggctag ccgttgttgt cctcgagcgt tatggccaga 

122101 ttatgacaaa cagccaacct gaaggattag ttagcggtta gttgatgacg gtgaatatcg 

122161 ccatgacggg gataacagag gaagcaccag ggattacagg gaccaggttg atttgttcaa 

122221 acatgcctac ctcttgaatg gtcatcttca ttgctgaagg aggttctcgc ctcccctttc 

122281 caaacaatcc acgcgcatct tgagaaagca ttcgtggtct tcacaacttc ctaatcgatc 

122341 tgatgggctg ctgtatctca gacgagatgg atcattgaac ggaatctggc catactgaaa 

122401 ccgccgctac cgccatttag cgaccctggg gcccatccga gacataatga gatgtgtcat 

122461 gtgcctgcat agaaatgtac aacgattcgc cgccattcga cgggttctgt ttcagctcca 

122521 aatttcaacc ccacaaagga cataacctgg attatatatc aatatgatta gcaaggcaat 

122581 cccagagcga accaaggcga cgtaataata cgttaagagc gttcaacgca aaccttttct 

122641 caccaaaaga aagcctcgga cggagctgcc tacaggatct tggattacct cacgtaccca 

122701 agcagccaca atgaccattc acttcttgaa cggcaagata attttgacgg cgcctgattc 

122761 gagaatgaat ctaaaaaaca gccgcgacag tcatgacgac gtggacagtc tctgcgcaat 

122 821 gtcacaggtt ttttgaagaa aagttttaga tcactacaga cctagaagct cgtgacccga 

122881 ccgtgtgtac atgccagtag cacacaggtg gccgcgctgg ataaaggttg gatcgggtgt 

1229 41 ggtgtagcac ccgtggtgta gatatcgacc atggagtccc ctgggtctcg ggcataatct 

123001 tgacggtgcc atagaagtgc tgagttagct gacccagata gcatttgaca ggtatttgtg 

123061 acccacctaa gttgtagatt ctacagtacc gatatttcct cgctggagga gaacaagttc 

123121 gaggatattc tttgaaatgg caaggccgca ccaacctaca gcaaagccat ctggcgctct 

123181 tcataatgcg gctgataggc tcgtaccttt ctgtacctgc ctcctctttg cctacaatct 

123241 gacgtgagtt gtgctcaaat tcagagtcat cttaggctgc gacggcaaca ttaagtctac 

123301 cctagggctt cgtgccagaa tgcaatagcc ggggccggaa gcctctgtga cgtgggagcg 

123361 tcctttatga tagaatgggc aaacatagcc gagtaccaca tcggctttgg taagcaatga 

123421 atggccatgt agctgagatc tttgtctcgg gtggtggttg cttttcctgg ctctcaatac 

123481 gcggctgggg ttgtggttac acgaaagttt atccaatcca tggaggaaat tctgcatcgc 

123541 ctcatatatc acgcggaaga ggcgtcccct tggcaggcgt atctgcatct cattaaccgg 

123601 cttggctggt ctcatttggc acatagcctg gtgacaatga tggcttcaat ctagaccatg 

123661 ttcggatccg gagtttctgt tgctcgataa ttcaactcac ttttccttta ttccgatcga 

123721 tgagcatatt tgctctgttt ttcttccatt gccccgaaca tcaaggtcac aaagatattg 

123781 acgtatccct tacggtctaa tccgagtcat tcgtgaacgt gcagagtaga gggccttgca 

123841 gcagggagca tgagtggatt tttgcggcag tcccgataag caatattcag ggtgtacgta 

123901 gtgtcttgca gggatccccg cagcacagag aaaatgaaat gcccatgtct gaggttccga 

123961 atcccaaggt cctctggctg aaccctgctc gacgcagggt cttccacacg ttcaacctta 

124021 agtgtcagag tgctatttgt tgtcgagttt tgctcgtatg acatgataat acgtttgagc 

124081 caggtgagca gtataccagc tttacaaaga ggacgtatgt agcgaggtag cacaggccag 

124141 cagaatgcca agagtcttct gcgtgggtta gcgttgtaca tgcttggagt tgcaaaattc 

124201 gccggggttc ttccgaatga cgatctccaa accgtcaaac agaaagctat gcctctcctg 

124261 acggtcgcga tccagcaaaa aagctagact ggcatcgaag gagataagtt ccaaatcttt 

124321 agctctggag tgttcttatt cgtagtcagc cgaatgagag actctagcgg caatatcgat 

124381 gttctcgccg tcagaaagcc gcctcgcaat tgaaatccag acctaaaatg tcttcacggg 

124441 cagcacgagg ggcgctgaat aacatatgcc taatcgccca tgtatttagc gcatgtctgg 

124501 gatgtttgcc aaaacgccac caacttcttg ggtcgggatg actggtcaac gctgctgtgg 

124561 tcatctttta gccggtgggt ctggattagg gggaaagacg gtgcggtaat ttgcagcaag 

124621 gttgacgatt ttgacagaaa aggtaatgcc aactcgattt agacggctgt ctggatgaga 



124681 ggatcgcctg taagccggat gttagtctcg ctccgttctt tcatttcgta gccacccggc 

124741 gggtaatccg tgcagatggc tcagcaaggg ctggatactt ctgctattcc agtcgtgtca 

124801 ggttggtggg aacacgaaaa ccagcactat aagctctggc aaaaggcagg gtatgttcgc 

124861 atagttatat gatcttcact taggtaagat acgactcggg cgctcagacg cccgttgtgg 

124921 aagtgaagtt ggtcagctcc ttaatatgga gacaggtatt ttacttcgat gggcaacaag 

124981 agccgtaatg tacctctagc actattccct ctgtcatact cgaagtcaga ttcttaaaag 

125041 acggagtata gtatggctag cctgttgggt ggctcccggg ccggtgaggg ctcaaagggg 

125101 aagcgtagga ggaacgtcac taattggaac cagtcaacac tatccctcgc ttgtccagaa 

125161 caagtaaatc aagcggtacg attgctccga gatccggtta cggagacttt gtattgctgt 

125221 cttctgaggt atgttcctag agaggggaaa acttacagta gcgtgataac caggaaatgg 

125281 catgtgcatg tgttaacatg atcaggttga tagcgatggt tggttatttc cctcttgtga 

125341 gagtatctgt gttgcagaaa gacagcagac gagtgctttg tgtgttcgcg tgaaccttct 

125401 tctgggcatg gctgctccac tccccttcac aacactcgga taagccgagc atcgaacacc 

125461 tgcagagatc aacctctgtg gcgtctcatg gtaaggcggc ggcctcatct gcttttctag 

125521 tcaaatcgac actcgctttg tagaggctta aggttgcctt gctgtagcca cgcgaaacac 

125581 cctcaatgta tctgtttatg taattcagaa ggctgttaaa ccagggggta atagaaaact 

125641 accctaggcc attaatcttg acggcctaac agatggaatt ggacgcagat tggaggaatt 

125701 atggtctccg atagatcaga agtgatcatc ttaactgata acattctgct cggcggataa 

125761 gtgcatgcgg cactacaact gtgctactgc atgacacgca tggctctggc tccgcacgca 

125821 gttcaagtta ttttaaatta ttgttttcct tcgacataga tcctctgtat aagagcattt 

125881 ttggcttgtt gcacgggggt ttggaattag aatacgggag atcaagtcat aaggctgaac 

125941 ggttggtggc atgggcgccc tataacacac aagtggaaga tcagcacagt tcgagaagtc 

126001 ggaaacggga cgatcgacag aggtggtagg tatagtactg gaaagcatca caaagagatc 

126061 taaatctaga acagaagatt ggctaattcc agggccactg aaatcataat ggtttcgacg 

126121 agcttcgatc acttccatcc tgggtgtctc acatgcaagt cgagagtgaa ggcttgcagg 

126181 tgtggcagcc tcttcagcaa gttgcagtag gttgaggcgt ttggctggtt tgcagcagcc 

126241 aatcagcgtg atcagatgac ttctttatag agagttgtat ttggaggtca cttggtagtg 

126301 actcttagtg acttccgagg cgctttccaa cttccaattt cgcactacta catcacaaca 

126361 tcctcttaca ttgcgcaaga gccaatccca ccccctgcac ttccacttct cctacatcca 

126421 gttcttctag cgctaccttc acttccgaat atggtagcat tgtcaactct atacaataca 

126481 cagtatctcg ggcactccac tcattatcat tccctgtttt ctccctctca ttcccgacct 

126541 gctaccacat ggcgtgtcag cttatcctac ggaagctcgt attaccctcc gactattgag 

1266 01 tctagctgtg gcccctagaa tcagaatccg ttggcaatcc actaaccaga tcgagtcgga 

126661 agatcgacgt cgcgtgagtg agcgtcatat cctatttttc actgccgggt ttccgagctt 

126/21 cttagctgac ggtgtacatg actcgtggtt atattctcct gaggctcccg gtacgaaaac 

126781 cgggagttgg ccgccgctaa cccaagtcac ggaggttgta gtttaacgcg tggagactcc 

126841 gccggggtcc cagaatggac gggtgttggc gcctttggct ggcggcctgt ggcctatgct 

126901 ggcctatgcg tctcggggct cagcggagtg cgtcgcattt ccccatctcc atccccttat 

126961 cggctggcaa ttcccccctt tcctttatct tccacgagac cccagtccta gaatatatct 

127021 gcagaaaagc ccgccaacaa gaagactagt acatattcgc accaagtccg atgcaactgt 

127081 ccctgctatt gcgacggggg aaagatagca gttgtctagc cactccacgg ataacatcaa 

127141 agaccccgga agaatacgtg atcaaatcag gaaattggct aattcgtatt caatggttca 

127201 ggatctacct cgtatctgtg gatgacgctg gattcgggtc agacctagac ggaagcatcg 

127261 cgcaatatcc gcaccgtcga ctcagaatct atttccagga ggcccggtcc actgtggcac 

127321 aagcgatact cgcgtgaatt aaccggcccc aggaaccacg tttccttcag gaattgatgg 

127381 accgcggagt ctggattcat aattgcagaa tgtctgagtg ggcgggtggg ggggaagcgt 

127441 ttcctacggc tagcgtggac atcctgcgac agcgctaact tgcgggccat tccgctgcca 

127501 acgtgctaga cacctccccc ctcccccccc cctcttgttg tgcaaacact gtgcatagaa 

127561 gaggcctcct gatacgccga gctgaccaac gacaggtgtg aaacgatggt cggtctaaca 

127621 gatgcctaga gaattagggt gtggcgaggt atacgggatg ggggacatgc caaccgtcag 

12 7681 acagccagat gggatccggg tgggagcgct aagtccgtta ataagcggaa tggcaatcag 

127741 ggttggtggc cgtggaatta tgcccactgg caaaaggacg tgtggctttc tgcggttaca 

127801 tccgaataat gttgaaaaaa aaaaataaaa ataaaaataa aaataaaaat aaaaaaataa 

127861 atccgctgca atgtgatgat caatccgcac tagctgacga ccattgcggt tccatgactg 

127921 ggcggattat attattttat ttatcttgcc taacgcggat ggcttggact tggtcgatcg 

127981 ccgcctggcg tggcgttgcg ttctggattc tcaaacggga gatcgaatcc aagacattca 

128041 tcctatcgag ggactatgga gccgagaagg aggcacgcgc tccgtggttg gtggagcagc 

128101 ctccgcgggc tgaaaaaata tcagggatgc agggcaactt cagattaagc catgcaggag 

128161 ctcatacagc cccccatttg tcgagcaatt atctcagcag gaactagagt taccttcgtt 

128221 aggccgggtg aatctgatta aatccatccc atgctaatat tacgcagact aagctacaag 

128281 ctacggtttc gggggggttc cgcagtctgc aggccgaacg ggggatattc acccccctcg 

128341 agtcgggtca gatgtaccaa aacaaaagga tcaggatcct tccatttgct tcccgatcgg 



128401 tcgatagggc agcttgagcg agacttgact aatctggcct attgcttccc ctcaacattc 

128461 gccccggatc cgagatgggt ggcccggagg ttgtacttat cgacggcgtc ccctcggtgc 

128521 ctttccctct tctctgatcc tactattcat actgcctcaa tacagctata tcttatacag 

128581 cttcctccca aacgccactc atttccattc tgcttcttct ctgattcatc ctcgaacaac 

128641 tgagctgctt gctttcactg agtgcggcta gtcaagctat gcccgctatg accattacac 

128701 tgagtgacca gacacatact gtgtctgaga cctcatcgac gaacaatggc atctcaggga 

128761 cagaatccaa gaagtcaggc actcggaatg gccatcccca gagcaactgc caccagctgc 

128821 cgcctccccc atcgcaaccc actgcggagg gggaataccg gtgcggaatc tgcaacaaga 

128881 catacagtcg acgtaggcta ctcttttctt tatggttgca cagtgttgac acattttctc 

128941 catgactact ggctcccttg atcattctca gagagacggc cacttaatgc acacatgtgt 

129001 ggttaacaag ataatagttg atctgcggga ccggcaccgt cgccgttgta ttaagaacat 

129061 tggccaggaa cgccagtcca agcgcaaatc ctgcgacgcc tgtgcgcaga agaagctgcg 

129121 gtgttccatg acccggccct cctgtagccg atgtatccaa agccgacggc cttgcgtgta 

129181 tccgcagtct tctattcctg ttcaagctcc caatcttgat gatgctcaag ataatattgt 

129241 tacgtcttcc ggctcccttc actcagtttc ggtctgcgcc ggtataatac ctggtggcac 

129301 gacatgggcc cttcccacat atccgttcga caccccgtct actgccgacg aagtaaatga 

129361 tgtttcgggc tcctcctgga gtcccgaaac tccaacaaat gccgaacttg cgttccaaac 

129421 cgttaacgag agtactatgt ttcccatgca agatgtctcg ttgatgaact ccccgccctg 

129481 gcaagaggat ctccatgaac ggcccgaaac ccttgggctg gaggagttct tttccgggtc 

129541 gcttgacagt tgttcgccag ctatgtatat accaccgacg atggggtcct ctccagtgac 

129601 catggtacct cctaccccgg ctactctccc gggcgactct tatgctacgg gaggcggact 

129661 gagtatgtca tcggtatccc ctggctattt cgataatttc tggcccgata acatggcgtc 

129721 caccgatgaa gaccacgaaa cttggcgatt cggcacgtac acagatagcc ggtccggatc 

129781 ggggttcggc caaaacttcc tgcccaaaac aagtcatcat agcccgggtg acatcggtga 

129841 tctctatcaa gagctgttca gcttgttacg tgagtacccc gggctggcac tgcagagaca 

129901 gttctactcg cctttcctac accacgagct acaagggtat gcgatgcaat cgttgggcca 

129961 gccaattagc actactctga gttcgatatc atcctatgct agttatctgg aggcttgtga 

130021 tacatttgat ttgtcaatgc acgatgggga gcggctcgca gcagcagaag gctgcgtggt 

130081 agcgcttcat gctgtctgtg ttcaacagat cctgagcatt ttcggcgata acttcgccac 

130141 caccggcttc agcaagctcc caaactttgc aggcgatcaa gggcataccg agactcctgt 

130201 tgacattctc ttgagggtat gtttcccccc actccgattg ccctacgccc caccctatga 

13 0261 tttgagctga cctgagtggc cctagatgac gcagcgggtc tacaaacttc atgaggacgt 

13 0321 tcttcgcacc ccccatgaag atgaaacaga ttggcgccga tggaaattcg ccgaatccct 

13 03 81 gcgacgtaat attttcttcg ccaatataat ccgaacatta gcaggacggg cccgccgtta 

130441 caacgggatc aacattgacc cgctagatag cgcgatcctt cttcagctgc cgttgccggc 

130501 gccggaggaa atgtggcgag cacgcagcga gggggagtgg atgatagccc gcgcacagac 

130561 gcagcggtcg tgggcgcggg gcgtggatgg cagcatgctg ccagccctgc ggaccctaca 

130621 acagctgctg gtgctggaag aggttggcag tgtaagtgtt acgttattgc ttcccatcac 

130681 acgcatgatt ctggcgtgtg ccaagctcaa tggagcttca tatggctgag gtggaccggt 

130741 gggccccgcg atagggcgat tatgatcgtc acggatgaat ttctttttgc atgattttcc 

130801 tgtttccaaa ataccctgtt cttctcacga agactacgat catcgtggga tgtattagtt 

130861 ttgtatccgc tagagagtag agactagatc ctgattgcaa ggctttatga ttatgattct 

130921 cagtgttgtt tcttgttctg ttgttgaacc cgtcccgaaa tcggaaccca attccgtggg 

130981 ggatctcgac attgatagat cgcatcacgt ctttagccgt cgccggattt tgatgatagc 

131041 cccattgtgg ccaaactaac ggtgactgca gagcccgtgc cgacatgcac ggatcaaaaa 

131101 gggatagatt gaaatcagcc aaacaaaaac tgttgggcgc atgtatgagt gagaactgga 

131161 cctggggctg ggcttggctg gctgagacgc caaaatccgg acacacggcg atgccgaggc 

131221 ctcgactcac gacgaacatg caggccgtca ggcaggcagg caggcaggca gacaggctgg 

131281 aagacgagga atcgcgaccg gacggcgaac agggtccagg caggttagtg aggataggtt 

131341 cagcgagccg atacggtacc tcgttagtgc tccttagttc tgaccagctt gctagccttc 

131401 ccgcggatgg taacctccgg cgggggtccg gcatccaccc gtgggattta cttggggggt 

131461 ttttttgtac ctctcgtgtc gatcgcagct atctcctccg aggttcgtgg ccaatcgttt 

131521 gtatgcagga gatgcagtag tgtcgacggg ctaggacgag ttggttaaca tctcctgggt 

131581 ggtggattgg tctcgccgag tagtaagtag tatctagtag tagtaagtat ctttctagta 

131641 ctagccgttg ttttcatact aacgagttgt tggcgatgag gataggagtg ggggagggga 

131701 gagatattat cccactgtgt gctatttttt attttcttct tgttcatcac caggtgccac 

131761 cttgatggtg tacaccatcg caacagctat aatgatgttc ttttgggggt ggttatttcc 

131821 acaaccccca ctcatacgca tacacccatc aacctattca ctaaattcaa cagccgcgca 

131881 aaaagaggat agaagacaat aatcaaagac agaagagggg tagaaaggat gaaacctaat 

131941 ctcgataagc gccgccaagc cctgtaggtc aatctaactt gccctcctcc gtcagtcttt 

132001 tattccgacc aagaaaggaa aggcagcaaa gcgaaggcga atccggcccc ctccgtgcct 

132061 cgctcctcca agatgggaat atgctgcgct tggctgcgag tttgccgata ttacactgaa 



132121 cctttccacc cccgccgctc gtctcctgtg ccgagtactt ggctatacgc tgaatgacca 

132181 ttgtttgatg gtgtgctgcc gtgctgggtg cttttcttgg gttacagaga gagaagtctt 

132241 gggagaggtc ttggtgaaga ggagatggtc tgggcatagt ccgtccgcta ggtaagtaga 

132301 ggattactat aagatcgcta agttaagagt ggagcatgtg tatacgtatg tattcctctt 

132361 ttgaatgctg atgccaagac gttggcagga gaggagacaa aggagggggg gaggaggagg 

132421 aggaggggag ggtgtgtttc gcgcctaggc ctgacgacac acttggcttt taatctatat 

132481 tatcttcact acttggtgag atactacttc ctgtttatgc agttgggtct cggcgaagtg 

132541 gggtggggat gaatgatact agcactaggc agtaggatcc agttctcgtc actgctctct 

132601 ctctcccacc cactcagctt atggactgct agctaacact ttgcccagaa tcatttctac 

132661 ctatttggat cggattcggg agtctcggga ctccggtggt taaccagatg atggtgacga 

132721 ctaacgatgg tcacgacgat gatcatatcc cagttagaaa agacggtttg acgagaccca 

132781 gtccaaccga tccaaccgaa ctgaaatgca atgaaccctc actgaaccgg gactgatcat 

132841 atgccgatgg aacaatgatc gctgatgcca agctcttctg cagtggaaaa gggagatcca 

132901 cgcggacgga ggattatgca gtgccagtta ctagtagtta gtcctacctc ccaaacttgc 

132961 accgcgagta gtaggtgtac tactagtagg atggatataa tttctccgtc ttggtgaact 

133021 ttctgcacct gcagtctgca gtacatcctt catttactct ttccttcgct agttgttact 

133081 atcgctatct cgcctaaatt tcctgcaccc atcttatcaa atcaattacc atcatctgtc 

133141 tgcatgcatc tcataaatta atcccgtgtc tgggttttat ctatcagttg cttatatgcc 

133201 gagaaatata aatggcaaac agacaaggtc cgcggagcta ggcattcatc atcggggtca 

133261 actaggacta ggtagggaag tgaatggagg actatgacta ggtaggcacc cttggggaat 

133321 aatggactct actattgtca tgccattgtc tttccttagg ctctatgtat aatctgtctc 

133381 attaattgac tactacggaa ccgatgcctc gatctgtaat gtgcagaacg aggctggcaa 

133441 gtcagcgagg atagaataag ccgcgtgtta agccaagccg gtaaagagac cggcaaggag 

133501 cgggccggga tgagtgagac atcctctgaa gaaaaggaat agcgagaggg gggaattgtc 

133561 ggaggaattg aggattttcc tcgtgtatag cacgatcttc tctcactctt gctggagcct 

133621 ggcccacacg gaaaggaaga aaaagaggga aagaaagaaa agaaaaaaaa aaaaaaaaaa 

133681 aaaaaaaaaa aggccaacgt accgagcatc ataagagtcc tcactaatgt catgtgcttt 

133741 gatggatctt gttagataat ttcattgttt atcatgagta agaggatgaa ggaagcgtaa 

133801 gatgaataaa ctttgggatg tctctcacaa ggtagattaa ttgggggtct acgttatcgg 

133861 ggattgggat tgttgatctt catgtggttg gaccctgttt cattcctcct gcttgtgctt 

133921 ctgctctttg taattacatt gttaacgtat actactactg tagctcacgc agacaacaac 

133981 atccgtatag tacttcctcc ccatccacct catcaaagga gcaggatctc accaaaagga 

134041 tctaatcacg cacgatcaaa aataacagtc caccctccct cttcttctgc aactaccaaa 

134101 agtcctgcat aacttatacc cccatccatc atcataaccc atcacaaacc cccactcata 

134161 ccaaaagcac tgggaggatc cgatattatc ctaacgagcc atggatatga atcctgtcgg 

134221 ctccaacttc tcccgagggt aagaaatcca gccttgtcgt cagagagccc aacacgaagt 

134281 acgcaaccct cgtctatatc tgatccaatt atttaaactt ttcgtttttc atttgacatt 

134341 tttcaatata gcaaacaata tagccactag gctagactcc ggagtgtggg aacggagtcg 

134401 gaatcggaag gatatggtca tcttcacata cttttgttac tggattacta gatcttcggg 

134461 ggatattccg cgcttaggtt accgtgtggg tgccttcaag aggtcagaaa taatcttttg 

134521 attgctgatc aagacgtttc cagtggaagt gcttacctaa ttacgttgca tctagcgtaa 

134581 cgtcttgagc atgtctccat tctggacact tatttgtcgc ataggtgagg ctatactact 

134641 cataagtagt agggatgagt atatctgcac ctgcgattat ttacctgaaa gagaaagcgc 

134701 cgtccatggg atattattac tggtcaatac aatctatctg ccactcacag ggcgggaaac 

134761 aaacaaggct gtatcttcct ggtgtggtaa aatcaatcca cccgcccatt tactacgaag 

134821 atcttcccaa cgttgccgat gattacctcg aaggtgcatc cttcccacac cataaacctt 

134881 cccatgtctg cccagaaagc ttctataatc tttttgtcca atacaaactc gatgaaaccg 

134941 gaagaaaaga ggggtggcgg ccaatggaca gggacaagaa ccagatacgt gccgtgtcta 

135001 gctagtgatg ataccatctg cggctgctac cactcgtttt gtgacctccc ttggctttac 

135061 tgtgggcgct tgtcgccact tgttactcat actttgccaa tggagagttt aggacggcag 

135121 tgctacttgt tacgagtgtg tgagggacaa ccgcggatat gactctctcg gaaagataag 

135181 tgactcactg agtttgatgg agaaaaggat agttgatcca tttgaccgaa acactcccag 

135241 aggagttgaa ataatcatcg aaggaaaggc aatgctattc attggtacgt tcatctctct 

135301 tttccacatc cttatcacac tgtatgacag tacattcgcc tcgtgaacta actccccaga 

135361 ttaatcaaaa aagatacgga cgaaatcacc caacatgcga taatcccctt cagacaccat 

135421 ccctcattac aacaattctg agtcctcctt caaacacaca ttatactacc ccggaatgcc 

135481 gtcatcgtca ttccaatttg ctctatcctt gaatgctgct aaagtgcatc cagacaacac 

135541 tcccaacccg gtcattacgc cactatcgtc ctcagtggcg ctccaagagg gataacgctg 

135601 ataagtaaac tcattaacaa ctatcgttgt tcatgcctgc atgcaacatt atccaactgc 

135661 gtccccgagg tggacctcat tgccgaagta tggatatcag gtataatcca tgacccttca 

135721 atgcgggaag gacaactgtc caataccttt tgccttcttg gatagagaaa aattggatat 

135781 aagatacaat ttgtgtctag tccttgaata tcaagtagca atatgacttc tcatcgtccc 



135841 attgttgaca aagctagagc agttgactgt acttcaatag aacagcgaac atatacttcc 

135901 agcccacttg tgaaataaaa tgctgtcgtg ttatggggta atgtagaatg tgacattcca 

135961 atacagtgtc gaaattcgtg ttcacggtgc ggaggcaggg catacagaac aagtaagagt 

136021 atctttctta acgcttgacc atcgtacatt gtgcaaacaa ctccttcgca ttctccatcc 

136081 ttgcatccat aaaatcatgt gcatctgtgc aaccgctgca cttgctacca gagttcctag 

136141 cgggacagct tctcttaatg ccatagaggt aatgttgcac gaggtacctc tgcaaacata 

136201 gtgtaaggat atataccggc aattggcagt acaactacat gctgagcatg ggtaagttcg 

136261 tgaattagac caatgggtag gtgaatgaca ttgcatcgag gtttagtacc actgggaaca 

136321 ataggcttct tccttgtgag agtgtcagaa tagtatatcc cagccttcgg gaacaagaaa 

136381 gatcggtatt ctgctgaaag agacccgata ttccaattgc aatctaatcc atgaatttaa 

136441 tcgacatgta agttgccgtg aatctttcat tggcatatta tgttgttcgg gatgtccacg 

136501 gagattggtc ggtttttctc agggttagta caatcatatc tgttctttct ttcttctccc 

136561 ccttgatcaa gatccatcat cagagtatcc ttgcataacc ccgtagctgg caactatatg 

136621 atttcgcttt tagcctttct cccttcttgt ggtattcccg ctgcttttgt ttgcattgtg 

136681 ccagctgcag atggtctttc ggcttcatta ctggacatca acatcaatca tgctcctcaa 

136741 ttagcggtag gttttttgtt ttggttatag ccggaagatc atgtgtatat tggtcaatta 

136801 tctctctctc cttggtagat aggccattaa tatgtccgat gtgtcatgct tagtctttcg 

136861 ttcgtggtca agaagacagt ttcatctggg aaacaagtca atttgcgaag gtgagcctgt 

136921 caagcgacgc gcaagcatag acaacgagac agtaggcgac gccgcacatg cgtagcatga 

136981 aaagtctctc tatgccgagg agcgaacgac taactgagaa cagcgataag cattttcagg 

137041 agctgctcga ggcagactga tatagtcctt ttcccgagga gccctggcag attgaccatt 

137101 cttatatata aggatatcag tgctcttcga cggtactcgg taaatcatga gaaatcatag 

137161 catggccgcc attcttgcat gaaatcaatg gaacatgacc tgaacttcac caccgtagag 

137221 ctgggaagcg gcatcatgcg ctgatctcac gcacgagtat cttcgcatag cgcgcgagag 

13 72 81 aacctcgatg aaggttaggc ttgtgaactg catgtaaaag ggaaccatca ccttgatggc 

137341 caggtagacg cagtataccc ttctgtccct ctatgttgag gattcgataa ctgcgattga 

137401 tgacgaacca cctcaaaggt ggatgtcagt atagcatata tcttgtaaag ctggctatct 

137461 cttgcagaga caacgaagta ggttcgtatg attatcatac cctccgtgga ctctgaatgg 

137521 ccctttccat agtcttgtcg cccattgtaa gtggaaatgt tcacatgtct ggaaactttc 

137581 aaattgttcg tgcgacggta agatgtgctt gctgtatgct gtgtctcatg acatgtagac 

137641 cgccatgatc tgtccattca cgatgtgtca tcaacaggcc ccagtcgcgc tctgaaagac 

13 7/01 ttttagctcc accaagctgt gtctaatatt atatcaccct gggtaagatg cagcctgtcg 

13/761 gatattcctg tagagagtag gtagatccat ttaataggta ggtggcagcc aaagagtctc 

13/821 tcactgctgc tcaacgtttt caatgcttgt accgggactg ggggctcttc ccttgcatct 

13/881 ccgaagtaca caagaatctt caacttcgat gtgaacttct taatctcact agcccactgc 

137941 cacacgagat gtggtggaca gatgattagg gttggcttat ccggttcata tgtggcatgc 

138001 taagatagat attagcggtg ggaaagagga gatctgggga agttttcctc agggatcata 

138061 aaaaaatagg gacatacatc cactaaataa ccagtaagag tccaagtttt tccacggccc 

138121 atgacatctg ccaacacaca acaacagagc ataggattct tagaaaactc cttgatagca 

138181 tagatggcaa caggctgcca gaagcgtaaa attgtagaca ctcgcattcc cttaatcttt 

138241 ggttggttga tgtctctgat attgcagggc cctgcacgct tcctcttgat ttttcccgct 

138301 ggaattcgag tgcatcaatt taagagcgat gcagatggcg attgcaacat gatgttcaag 

138361 ccagatgtgt cagatcaagg ctgttcatgt tcgacaatgt tcttcggctt tgctgttcat 

138421 ttctgtctcc atttacctgc gcccagatgt tcacattgtc taggccaaga acctgtgtgc 

138481 catgatccca gaaacatggc agacatcatg taaaagtcgt atctggccat agtttccccg 

138541 cacttccata tccagaccgg aatcgtaagc gtagctttat ctgatatacg aaatttcctc 

138601 ttcatataca gtgtcatgtt cagtcatgtg tatgttgacc cttcatgtga tagaagacag 

138661 tggaactggc gggaccatat ctcccactga tccggtaaac cacaattgat tctgcacgct 

138721 gtgtggcttt tgctcgacct gttatataca ccgaccgatg ggttgatagg catgatcgat 

138781 cgcggcgaat ggcgcgggat tgtggtgtgg tctttcgaaa accacaagga atgcgcaatt 

138841 cttctagcca catgggggta ggatatgtgg aacgtatttt gagctaatat cgaaccatct 

138901 tagttcaact gaggtagtac tgcaaatcag agtgacaaag ttgccaaacc agtgcccatt 

138961 ccttcactca tatgtccaag cactctctca ttgtacatct cgacatccgc aaccaggcac 

139021 ccggttacgg atgtcattga cgctccacag attatgccaa tatggacctg acattagtat 

139081 aggacgttcc tgtttgaccg agaggatagt atctatatat catgccatat ttcgctgtaa 

139141 agggtaaagg taatagcttc gtcgatgagt ctcaggggtt agctcccgtc ccatcagcgt 

139201 atcgttctga taaatgtgat atgatgcacg ccattctgga tctaaaccaa cggtttctgt 

139261 cctcggcgca atatatatag aagtacgagt ttcttccttt cggttgttct cttccgatgg 

139321 actcctagtt gcatactagg agtacttgta gtactcatca acccagacca agcacacggg 

139381 ctgtcagggt tccatcatgg ttccacagaa catcccacga ggctgtggtt cattttgaca 

139441 atgtctgagt tctttcatgc tttgtctcca ttgtcaaatt ggcattgttc ggcgtcgagt 

139501 cgttcctttt gcggatgcgt ttgttttctt cgaccgttgg ctagaattaa agtttaaact 



139561 tgcctcctat gccaaagatc cattggagat tttcagctgc gtggtatatg cttgagccga 

139621 acatatattg tcaccgaatt tcgattagga atataatata acggtaacag ccgcgacaag 

139681 cacggtatct gggagtttcc gaagcctagg ggcccaccct attctgatcc tcaggcaata 

139741 tgatgcgctg caggggcata tactatatga cagttcgtgt taggaacctc caaaaggtca 

139801 attagttatc aggaggatct ggccaactca ccagcatgca ggcatgttac agcagagccc 

139861 ggagacatcc cacttcccca atcgacaacc accacacggc tcacaacgaa gagaaaggcg 

139921 atatcactct acccatagat gacatattca attaccagaa tacgtagttt cgtctccgag 

139981 cttaaattct ggctgtgacg tagagaaagg ctatacttgc atcctctaac cctagtcctc 

140041 attggagtcg ctaggcatta tggcgagaat tcggggcagg ttttcttgcg gagacagttc 

140101 catagggcac gcaaagaaag gtatataatg gatgtgtcaa gtcatgatgc ccttgagtgg 

140161 gtttttgtgc gtgattgtct cttatatata ggaaatggga tgatggagag aagagcagaa 

140221 agatgggatg gatgtgggag gaaggtcggg tttctataag ggctatttct acgggggtga 

140281 attaattaaa taaatgtgcg agcaagatgt taattatctg tggttgaata tcacagttga 

140341 attggtgtct tggcggaatg gaatggaagg attcaggggt gaatctgcca agtcaatgta 

140401 tgcactagcg agatggatca ttaggcttca tcgtttcatg gctcggcatc tatattagtt 

140461 ctttcgttgt tggcgataaa ggcgggaatc attcacaggc agtcattgtt gcatgaatag 

140521 aaattcatta gaaaaggaaa tattaaagag tgtagtctat cgctaaacca tagttcgggc 

140581 gaagtcatta accgaacacc atacagccag aaaaacacaa agccgcccac ccagggtata 

140641 caatcattag tgaaaactgc catgatggtg gctttcatga ctatcaccat cggccaccat 

140701 ctctatcgtc tgtgacacgt ccaccgtctt gtgaatacct ccagtgtcat taccgcctgc 

140761 catatcgacc gtccctgcaa atgtaatcag atgtgactgg tcggaggcgg actcggtgcc 

140821 atcggcgcga cgggaattac cgagggagaa tcgcttgttg cttcctttga gtttgctaag 

140881 gttgacagag gactccggtt cgtaatagcg ttgagagctg tactccttct tgcggagttt 

140941 gttgatgagg atgttgaccg tgggcagaca agcacagatc atggccacgc ctccttcggc 

141001 gtttctaatt ggggggttag tctgtgacgt tgattattcc caaaaacaac ttaccctgac 

141061 agaatcacga tcataaagac aatagtctgg tccctggaac taccgtcacg caagaccaga 

141121 accagccggt acaaactgaa ggccgtggcc agaccacccg ccgacaacat gccgatgacc 

141181 cgcaacttct tattccggga catctgcagc gaccaggtga ggggaagagg taaaatcaag 

141241 atgataaagt cggtcaccac actgatgaca gagtcggcaa tcagggcggc ccgttgattg 

141301 agacaacgtg cctttggatc ctgcgtccag tatgccggaa ccgggtcgca catgcgaatc 

141361 ttgacgatct cggcgacaat gtagtatatg cagagacagc caagaaggcc gtagatgaag 

141421 taaattttgg aacggtaagg attgaagatg cgaatgagga tagaaaggag tgcgatcttg 

141481 acgaagagtg ccatggggca gtaaagaacg gtggcgatgt aacagaactg gtttgatcct 

141541 caattagctg atgctactta tcccccttct ggtaactgag ctttcatacc ttgcgaaaga 

141601 gaacctggtg tgcttcagag acatcatcga tgtggtaacc cccgccatag tgacccacta 

141661 gagtcaagtt agttggagct gctggagagg taaaagcgat acataccggc aatggcaatg 

141721 ccacagtatc ccataaacag aacctgcaat tgtcagtaaa tcgccctatg aagggacctg 

141781 cagtctcaaa cgcacccatg ctgtaagaca tgaccctggg agagttagca gtgttttcta 

141841 gaaagagaag tcaagcagac atacagtctt ccacgcctac gaactgtttg tagtaaatac 

141901 gaatagagaa acgcaagaga acgaagatag taatgatcgg tatgcacaaa cattgtgtga 

141961 cgaaattgat ggtgttcaaa acatctttgg gatgactgac agtcatggcg tgccagtggt 

142021 caatgcagcc tcttcaagca ggaagaaaca acatggagca agacactgac cccgacgacc 

142081 gtggggagtc cacaaacatt tataaatagg ttcaagttag aggcgcttgg tctgtcaatg 

142141 cgcacagtgt cgtcgcccgg ggtcaagctg gcgctgcgcc gtctatgata ggcccatgaa 

142201 gtcgcctaga gaggctctca actttgggtc ggaagctggg agtaacaatc cgagaaatga 

142261 cttcaatggc actgggtcac gatcgtcccg cccaggtgta gatagaaaaa ccttaaaggg 

142321 atacagcaac tccccgatgc agtcggactg ggaggcgacc atggagtatc cgagcggatc 

1423 81 gatcgtgata ggccccagag gtcataacgt ctacctgttt tgggagatga tggacaggga 

142441 gaagcaaaga tgcaccttgc attggatcgg cgtggctgct tgccactggg gcgtcgcacg 

142501 cgatcgtggt ctagtctaga tatgtagtat cgtgccaaat acactcagtg gtgttgcggc 

142561 ggcggcgtcc gtggaccatg gttctgaaaa attgacaccc gtaggcggat cccagtagct 

142621 aagagtgtct catttcgacg ccgaaactgg ggttcgacgg gggtgtccat ctgcaatgcc 

142681 aatatttttc ataattacta ttggataatt gattctcaaa tccctcctcc cctccgggct 

142741 agccgcggag acatggctgc agggcgtcgg aaatatctcg aacgaacaat ggagataacc 

142801 aggagtaata accatcgccg attattgctt cctccccggg cgctcgttcc aatagctcct 

142861 cgcactacct atgcaccaga tcctggctgc atataggctg tggcaaccac gcactatctg 

142921 tcatggtgct gcagaatcac ccgccaagga cccgagtgag ggattagtga tcgccgtgtc 

142981 cgagccaaac cgcaataggg tattcggcat aaaggagaac gctggaaagt gggttgggaa 

143041 gcagcgtctg tgaaaagcct gtgatgccgc gggctccggc aaactgtcat ggagccagtc 

143101 aaccctggca gaatggatcc ccttaattca caagcttgac cggctcgttg ttggccttga 

143161 cgggggttgt tcttccggaa gaccgacttg cgagaccgac ccgttgttta gcgtgtgccc 

143221 taaaagtctc taaagtctct aacgtaaacg ccaagcgccg cgatgaagaa ttttgacttt 



143281 cgcttgctgt gtgtcgagtg gagcgtcgga tggcttccat ccctattatt tcctggccga 

143341 tatgatggct cacattgacc tcgcattgtt cccctccccc gtgtgatctg agaagaagaa 

143401 attgccccca ggttgttgtt acctttgacg aaggtggctg ctattgaaac tctcctgagc 

143 461 ttctagaaga cgccttccct gcgcccgatc agcaaggctg cagaaaggct tgaaacccac 

143521 gcggtttgcc acgcgctgca tgttgtacaa ggggtcaaca gggaaaaatg aaacccgagg 

143581 taaccgatga ctaaactgga gttgggtgta atctgtggtt ctgtgtaggc ttcgtggata 

143641 cacctccact tcccaaggtt cagcccgtcc ctcccatcca atttctcgtg tcctcgacac 

143701 tctggttggg ttccagcaac ctgtcaggct ccatctgtca ccctcatttg gcaatgggtc 

143761 tgctgctctc cgttccagtc cactgagaag agggaggagg caatgaagcg agcaatagtg 

143821 agaagtagcc ctcatggtga acaccggttt gctgtcgcac cacggcccgt gtcaagccag 

143881 aaagttcctc ctccattgtc cacttgtggc tccctcgttt gtctttacct tcccaggtag 

143941 ctttcccata aattgcacct ttacgagaat cctcacaatg accattgtcc aggcccaagg 

144001 agcggattct aagctctttc agccgcttgc cattgccaat ggcaagctca ctctgagcca 

144061 ccgggtggta cacgcccccc tgacgcgcaa ccggggcgaa ccgttgaact caaacagtac 

144121 tcccgagaat cccaatcgga tttggtaccc tggggatctg gttgtcgaat attatcgtca 

144181 acgggcaacc cctggcgggc tgattatctc ggaaggcatc cctccatcgc tggaggtaaa 

144241 tattccccaa ataccagctc caatgggtca ttcgtctgtt ctttaacacc aaagcagagc 

144301 aatggaatgc caggagtctc cggtctcttc acggaagagc aggccgccgg atggaagcgt 

144361 gtcgtcgata cagtacacgc ccaagggggc tacatctact gtcagctctg gcacgccggc 

144421 cgtgctacag tgccccagat gaccggctac cctcccgtct cagcctccgc cagcgtctgg 

144481 gacgaccccg aagagcgcta cacccatccc gctgtgggag actcggagcc cgttcgctac 

144541 tcagaccacc cgccgattga attgacggtc gcgcacatca agcaaaccat ccaggactat 

144601 tgtaaagccg ccaagactgc catggatatt ggcttcgacg gcgttgaact acattccggt 

144661 aacggatacc ttccggagca gttcttgagc tcaaacatca accgacggac ggacgactat 

144721 ggtggcaccc ccgagaagcg ctgccggttc gtgctggagt tgatggacga actggctcag 

144781 acggttggcc aggagaatct ggcgatccga ctcacgccgt ttggactgta caaccaagcg 

144841 cgtggtgaac agcgcgtgga gacctggacg tatctgtgcg agtcgctgaa acaggcgcac 

144901 ccgcatctgt cctacgtcag ctttgtagag cccgtaagca taagatttat aaccggctga 

144961 atcccactta ctaattcatc cagcgctacg agcaaattca cagttacgaa gaaaaggatg 

145021 ccttcctccg cagctggggc ctttcgagtg tcgacctctc ctccttccgc aagatcttcg 

145081 gctccactcc cttcttttcg gctggcggct gggaccagac caactcctgg ggtgtcctgg 

145141 aggctggcaa gtacgatgct ttgctatatg ggcgatactt cacgagcaac ccggacttgg 

1452 01 tggaaaggct gcgcaagggc attccctttg cgccttatga tcgcactcga ttctatggac 

145261 cctttgagga tagcgcattc cactatgtgg actacgagcc tgctccgcag aactcgaccg 

145321 ggcaggagtc cgtaaacgtc tcggtccgac tatgattgat gatttgtata tgcacataat 

145381 tgcgatgctt gtttagcttg attgaatttt agtttgcact tggcccatcc gtcaaagagg 

145441 gccgatctac gtcggtatta attcagtgct gtacgcaggt agatcaagaa gccacagagc 

145501 tcaaggcgct caagccgtgt aaccaccatc cacaaccaag gcggagccta ccatgtagct 

145561 gctcatcggg gaggccagaa acaggactgc atcggcgatc tcctccggtt gtgccctacg 

145621 gccaatggcg actcgctgtg aatctggcac catgtatccc gcatcccaat atgtgcggat 

145681 catatcggtg tccgtgtatc tggatcgtta gctaactagt ggatcttatg ctgaccaacg 

145741 ggtcacctac ccaggacaga tagcgttgat gcgtatctcc tgttgaatat acgctttggc 

145801 gtcctaatcc ggcttagtct cgcagtctgg gggaatcatc atgaacaaca gaccattttt 

145861 gagaaagcaa ccacagctgc aaccattagc catctaggta cattcatcca ccaatcccag 

145921 gcatccactt accatgcttg gaggctgtat atgccgacag tcctgaccaa cctcccggag 

145981 ccactacccc aaacatagaa gccacattca cgatcgtccc ccgaccgaga cgcaaaccgc 

146041 gcgatctgca atacgcttag ccggctatct tcccctcaag tggaccagtg aaacatactc 

146101 ctgtgtcagc atctggcgta tcactgcctt ttggcagagc agcaccccgg tctggttaat 

146161 atcgatcacc ctctgccatt ctgctgccgt cagctcatgg gtcgcggacg gctgccccac 

146221 aattccggcg cagtgaaccg agatatcgat acgaccgaat cgctcgacag tgcgacgcac 

146281 ggcttcatcc acgcaggcct ccttggagac gtctccaacc gtgagttcca ctgcaacttg 

146341 gggatgtgag gtagcgagct cgtctttagt gcgttggagg agtgactcgt ttacatccag 

146401 aagagatact gcttcgattc cattttgagc aagcgcgaag gcggtggtct tgccaatgcc 

146461 tgaagcctct gagcacattt tcccgatcgg gctaggtcat catactattt gaggaaactt 

146521 ttacttacca tttccacccc cggtgataaa cgccgttcct gatactttgc aggccatgat 

146581 ggatgtttgc ggtaattaat actccctatt aaatgagatg tgaaaggagt tagatttcgt 

146641 cgggcgctgg cgagtgctgg acgaccatat aactagcttt cgcccctgtc agtattcgag 

146701 cggggtcaag gcattgtctg tctggctggc tgcctcggtt attcaggatc caggccaatg 

146761 gaccagcact aagtagtatt atcctttgtg gcatagatat gtcaagtctc tgcattgtac 

146821 ctttatcttg gtgtattcac gtctctacat gctgttgact gtagtctctc tcgtcgatga 

146881 agcgctattt gctactagta tgactactag tatttttcgc cctcattagc ttcccatttg 

146941 accggctatg gctttgcggg gaaactgcct gagcatgtta gcttatgcat ctcttgaata 



147001 tgatcatcgt tcttcacgca aacttaccga tctgcttaga tgtacacagg actgcttaca 

147061 caaaaggtcc gagaacagta aagctccttt ttttctgcag gtcagggaag gtgcacgaga 

147121 ggttatgcca gccgatcata tataaggaaa cgcagattgt attccatgca tacatcataa 

147181 ctcataaaat gattcataac caatccatat ccaaagccaa acaacaaaat acacccaaca 

147241 aaaacaatac aggtatcaag tcacgtagaa gcgtatccaa cgaacagcca aataaacaag 

147301 tataacacct ataagcccct aaccaacatt agcacttcca tccatacata aagaccctta 

147361 ttcaatgaac aataaactta catcaaaccc aagacaatcc taactaaacc acctccgatc 

147421 ccggatgctg ctgctgcacc ggataaacca ccggctccgg cttcttcatc atcgattccg 

147481 agtatccata ttctccttcg ttcatcgcca cgggctgctg ccccacagca taagcatgat 

147541 tatagtaagg cggggccggc gtgcccggct gttgctgttg ttgttgcggt tgcatctgtg 

147601 tatgtggata catgtactgc tgctggaact gttgccggta gtattgggca cggagattat 

147661 cttcgcggcg ttcgcgcttg cggtcgcgac gatgcttgat gaggaagtag atgcagacgc 

147721 agatgaagat aatgacgagg atggggataa tgatgatctt tccgatgatc ttgccgatac 

147781 tcatttttgg gagttggtat ggagctagtt gaggtggaaa tggaggagat agggtggtgg 

147841 tatggaactg tgatgggaat tgtctgtggg gagatataat agcagaggaa gatcggtatg 

147901 gaggtagtta agtgacagta gacagtatga gactttgaat agatagatgg atggatagat 

147961 attaggcagg tagtagccag gcaggcatag gacagagcta gacaaagtca gaaacaaatc 

148021 cagaaagcca agatcttccc ccacaagctt atatatctct atctgcctag caaagacaga 

148081 ctggaccctc cagccctacc ataggttggc aggtaggcag gcgcgttagg aaaaatcaat 

148141 ccccagcacg gggggtgatg agctacactc ccaaagggag gacgttgctg ctgagcgctt 

148201 catggggagt gctgattcgg tgggtagaag caggacggca gcgtgaatat gcaggcgggt 

148261 tcacctctgt cacacgcagc cttacgcgtt aggttagctc ttggtgcatc cttgataaaa 

148321 gcttggttga tggacatgac actgatggtt ggaggagagt tgatgatcgg ggcttgggag 

14 8381 ggagccaatc atggaggcgg gtttggcgcc gcgggtgacg gtcgcgcatc agcctggtgt 

148441 ctgtcgaatc ctggtagggc tggaggggag gagaaatagt agatgagtgc taaacgctac 

148501 agtggattgt gtgatttgaa tctattttta tttaagttcg catgagggtt ttgaggagtt 

148561 aagatactgc tactatagct taagagaccg cagtagaagc gatccgcccc cgaaatatac 

148621 aatgaggaac ccaaggtcat tgcgcagttg tattgaaata gggacggcaa aaaaaaaaac 

148681 gataaaagga accttgggca ctttatagcc ctacatggct aaagagatgc ttcaagagac 

148741 atattttacc atcatcaccg tattcgaaat gattgatctt gccccctagg ctctttcacc 

148801 ccattttcat tcttgcagtt actagtaaca catcctcgcg gtcgcccgga cctagctgac 

148 861 ccttacagtg acagcaaaac agaaaatttc tctgatcacc atagaaatta caatcctaaa 

148921 aattattttc tggtactttc ccatcaaggg cttcggttag atcaaaatgg atcgtgattg 

1489 81 tcaggggtct tcaccacaaa tgttgacctg gaagaaccaa tcccgaaatt cccgggctcg 

149041 ccgcggagtc cgcggagagc acggcgggcg cgggcttcac ggaagcgaag gaggtataca 

149101 acatgaacag cattggagga ctgataaaaa tccgcgatcg cccgatcaat gaagaactca 

149161 caagctccat ccaaggagaa taatgtctgc tatgagcaca gctacaaggt aggacagata 

149221 tctatcgaga caagccatgc actgaatact aaaacaacta gccaccaatc cagcatgaag 

149281 caatccaagg atcgcagccc agaggggaca tccaacgccc acgaggatga gaaaccgaca 

149341 ttcgagtttg gaggcgcccc tggggttacc atgctgatgc ttgggtttcc ccttctcatg 

149401 tactacatgt tcataggcgc gaccctgtac gacggccacc tcccgttacc cgatgagaac 

149461 cagtctatca ccgacttcct ctcccacctc gttgacctcg cccacacgca tgcctacccg 

149521 aaccgcaagg catggataat atactggacc ttccttgtcc tcgaaggact cgggtacttg 

149581 tacctcccgg gcgtatacgg aaagggaaaa tgccttcctc atctgaatgg gaagcagctg 

149641 gattattact gctccgcggc gtcgtcgtgg tatgttacca tcgctgctgc gttggtcctc 

149701 cacttctctg gtattttccg tctgagtacg ctcgttgaag aattcggtcc gttgatgtcc 

149761 gtggctattt gctccggctt tctggtctca attattgcgt acgtgtctgc cttgctaaga 

149821 ggtgctcagc atcggatgac ggggtcccat gtttatgatt tctttatggg tgcggagctc 

149881 aaccctcgtt tgtttcagtg gcttgatatg aagatgttct tcgaggttcg cataccctgg 

149941 tatattctgt ttttgttgac gctcggaact gcgttgaagc agtgggaaga ctatgggttc 

150 0 01 gtctctgggg aagtgtcctt tctgcttttg gcacacttct tgtatgcgaa tgcctgcgct 

150061 aagggggaag aactgatcat cactagctgg tatgttccta tatgcggcgg acgaatcgac 

150121 agtactgata ttgccacagg gatatgtact atgagaaatg gggattcatg ctcattttct 

150181 ggaatcttgc tggtgttccc atgagctact gccattgcac cctgtacctg gcgagccacg 

150241 atccgtcaac ttataagtgg aaccctatag cactgggcgt gctatttgtg atgtatattt 

150301 tcgcatactg ggtgtgggat acgtgcaaca gtcaaaagaa cctcttccga gctcaggaga 

150361 gaggaaagcc tgtgaaccgc aagacattcc ctcagctacc gtggaaatca gttaagaacc 

150421 ctgtctgcat caagacaaaa acaggagatt ccatcctgtg cagtgggtgg tgtaggtttt 

150481 actctttcct gtattgcgaa tcagggctga tgaagtatag atggaatggc acgtaaggtc 

150541 cattacagct gtgactggtt cttcgctttc tcctggggcc ttattactgg ctttaactcg 

150601 ccatttccct ggttctactc gtgcttcttc actgtcatga taatccatcg tgccctgcgg 

150661 gacattgaac gttgtcgcga aagatatgga gaagcttggc gcgagtatga gcggagggtc 



150721 ccgtacttgt tcattccggt aagtcgcctc gagttggtgt tgtgattcag ctaacatcag 

150781 aaaagtatgt cttctaagtc attggaataa gggtttgggt ctttcagttg ggcgtcctaa 

150841 tttagaatgc attggccaaa ccaagactca cttttattag cttattatcc tgttgacctg 

150901 cgtttgctat gctttaccag actgtatata ggattataat cttggataaa tataaccagt 

150961 cacatgtccc agtgtacctg ccttctttgt cgacgataga cagaattagc atatcgcaga 

151021 cagtgggaaa gttgtccaga ccggtcttcc attctttatg agcacttcaa agaatgaaag 

151081 gtatggaatt tagaagccac agccaacaat tagcaatgaa atgaagatat ggtgtattca 

151141 ttaaatcgaa tcgctattgg gatcgaagct tagaatatcc atgaccgatc gacttgtgcc 

151201 aataaagcca tttaggaaca tggatatgtc tgcctggttc atggtggaat caaatgtatc 

151261 ttcaggccat ggaacatctg gtagggcaat atttccgggc ccatcagttg tgggaataac 

151321 aagggatcca tcttgatgtt gttccaaccc agttagtgtc tgttgtgagt tcacgaggat 

151381 ctgggcagcc cggacaaaag cgtcacagat agtgtataag tgcattgacc ctcgtgaaag 

151441 gcctttaatt agttctaggg agctggcggt atcctgtagc agagcgagat ctgctgtact 

151501 ggcggtcgtg atggcgtgag taaagacgat gacgaagggt gtaaatgaag gataaagaag 

151561 tatcctgtcc ctttagtgta ctataccaat gcagattatc agtttgtctt accaatttac 

151621 gtattctgtt tgcttatgtg tctggcgacc gcggaagtag gtgaagcact tgagatgact 

151681 ctgtaaagca gcagtcgcag atgcatagca ctgggaagaa accgccatta cgtcgcgtgg 

151741 atgggtctgc gcgcggtaga tgacggtcaa tactgagtat gctataaaat ctgcacaggc 

151801 tgccatccca tgtagtgagt cggcatagag accccgacta acatcaatct tgtaaaagca 

151861 ttagctgact tgactgcctc atcagaagga atgcgtaccg caagaagctt gtcccgcact 

151921 gtcattaggt cggaggacag ccgctcaatt gatgctgatc gttgctcgtc atctgctcga 

151981 caagcagagg tagagtacag ctggtcatag actctaccct gaacattcgc gaattcgacg 

152041 atcacaagcc ccattaagtc ccacgggcga tacttgggat catttgaagg agcataaagg 

152101 tctgcatcga tgtcgtggtc ctgaaagtgg gatgtcacgc caatattgag agagatattc 

152161 ttgtcaatgg tgtaaagtgc ccagaacgca tggcgttttg cctcagcaat agcaaactca 

152221 tccttctcta aagtggaacg tcggtgtaag ccgatgctct ggcacatatt gaatgccaga 

152281 gaaaggaaag tccagcacct ctggacgtcg gtttcctctt gtgctttcag aacctgaatc 

152341 tgtcagaatg tgtatcacct ggaaagctca aacatacccc aagcaacagt gcctgtatct 

152401 tctccaaagt tggctcgaac atgatctcag gtttactcaa gccagcgaca aagtgtcgct 

152 461 cacagagctc aaggcttaat tgaaggtcaa agtgtgccag gtcgggatct cctgcatgaa 

152521 gataatcgcg gatcacataa aaaagtatac catagaagag tgtgagcgag ccaggtggca 

152581 caggatctag cgggaaatat atcgactggc atagatgttc cacgcgggtg gagtctaacc 

152641 aagattgact gaccaggaag aaaggaggtt gagctatatt gcgtcagtga tagcaacgaa 

152701 ggggaatgga gatgtataca tttaaccttc ttcaccactg ctacgaccag atctacagga 

152761 agcaatgtct gatttagttg tgcaggtgat tgtgctccac tccgattcat ggcgtgattg 

152821 tgcctatcga ggctactttt tagagaggag agcgcagcag agacataatc cccaacctgt 

152881 ccatctcggg cagtattagc agcagaatat gctgcgtccc tggctacaag ggtctcatga 

152941 tggaaagatg actgtccctc aaacggagga gctgttgggc cggtggctat ggggtgagaa 

153001 tctggctgta ctgaagttga tggcttcgtt atctgatgga gtgcttgctg aatggttgct 

153061 aattgttctt ggattgccga gaacctttgc tcactaactt ctgtcagccc tgtaagcagt 

153121 gatgatttaa ataaacccac taatcgtcgg aatcggctct gcgtgtttcc gaagacctga 

153181 caggcgtacc tgtcggacca caggcgattc caagagcgct gcagctggag caagggacta 

153241 ctttatcaca cttaatctaa agaaatttcc atgagcatca atataataca aaattggccc 

153301 tcagggtatg ataccttccg aagccgacaa ttgtcgcact ggtgggattt agtacttgtt 

153361 atagatcaaa tggtggcgga cgaagaagcc gtaccgcgcg ggcccggaac gagttgcggg 

153421 cgatggaagg agttcgggca gccattgtag gtgaagccgc tagttatgac cgataattcg 

153481 cataatatat cgatacttct tgcctggaac gtgacaacag tggctgtacc ccggcaaaag 

153541 gcttgcgagt aataaagtgc caaatgacag gcaagggagc gagtaaggcc tccagtcatc 

153601 ttgggaaggc gccattcccg ttcaggctag atcacacccg tttgggttag cgagtcccgt 

153661 gacgcccact catctcacca tgaccaggcc tttcccgatc gggagggtct tgacgcttca 

153721 caaacaaacc tggattcgat ggtcacggag attctccgag gctaaactac taacacggga 

153781 actatcagta gaaatccagg aaagagcttt ccggcaaaag tcgcccttgc tatgatgctg 

153841 tcaaaccata caatttaccc cgaacggcaa caatagcctg aatacaaagt ttagcatgaa 

153901 gtcctccccc gcgttcgtag cccgacaatg cggggagatc ttgcagacag gccatgacgc 

153961 gcgtatatat gaatgaagaa gagggctggt attctaggtg tctttgcgat ggcatagcca 

154021 gaaggaaccc gccaccatga gaggcttgaa tctctttcaa ttaatactgg ctcttctgct 

154081 gtccatggta gcagccgagg atgggtacga tggctggctc agatatgctc ccgtgtcctg 

154141 cgatctgcat tgtcaacagg ctctaccgtc tcatgttgtg ttgttgaaca gcaccaaagg 

154201 aagcccaatc gagaccgccg gacgagaatt gaaagcggga ttccaatcga ttctttcgac 

154261 gaacttgaca tctcgtccat ttcaatgcaa tagctccgca tcaattctgg tggctactct 

154321 ggatgagtac cgccaaagat gccgggacat taatgtgcct gagctcgatc ccgatggctt 

154381 ctggttacaa tccgaagggg acacagttcg gatcttaggc aaggatgcca gaggagcctt 



154441 gtacggagca tacgaatatc tagctatggt ggcacagcga aatttttctc gtgtcgcgta 

154501 tgccacgagc ccacatgcgc cgatccgttg ggtgaatcaa tgggacaaca tggacggaag 

154561 cattgaacga ggatacggtg gtgcgtcgat attcttcaaa gatggcacgg tggtggaaga 

154621 catggctcct gttgagcaat atgctagact gcttgcatcc atccggataa atgcaattgt 

154681 cgtcaataat gtcaatgcga acgcaacact actgctaccc gagaatatga aaggcctgag 

154741 tcgtatagca gatgcatgtc gaccatacgg cgttcaaatc ggcatatcgc tgaactttgc 

154801 ttcacccgaa gccttgggtg gtttaaatac ttatgatcca ctcgatcctg gtgtcattgc 

154861 atggtggcag aatatcactg atagtctcta tacctatgta ccagacatgg ctgggtacct 

154921 cgtcaaagcc gactcagagg gccagccagg tcctgataca tacaatcgca cactctcaca 

154981 aggggcaaat ctgttcgctc gtgccctgca accgtatggg ggtgtgctta tgtaccgtgc 

155041 tttcgtctat gatgacaact tgaacgaatc agactggaag gctgatcgtg ctaaggcagc 

155101 agtggagtat ttcaaggacc tggacggtca attcgaggag aacgtcgtga tccaaataaa 

155161 gtacggcccg atcgacttcc aagtgcgcga gcctacctca ccactttttg ccaacctcta 

155221 ccacaccaac acagccatag agttggaggt tagtcaggag tacctggggc agcaatgcca 

155281 tttggtgtac ttacctccgc tctggaagac ggtcctggat ttcgacttac gcgtagacca 

155341 caagccatcg atggtccgcg atatcatatc cggtcagcgc ttcaacagaa cgctcggagg 

155401 ctgggcagct gtcgttaatg tgggcaccaa cagaacatgg ctgggtagcc accttgctat 

155461 gtccaatctg tacgcttacg gtcgtttggc gtggagtccg acagacgaat ctgagcagat 

155521 cctcgaagac tggactcgcc tcacatttgg acaaaatcac catgtcatta acactatctc 

155581 cgatatgtca atgacctcct ggcctgccta tgaaaactat accggcaacc tgggcataca 

155641 gacactgaca gacatcttgt atacccacta tggtccaaat ccagctacac aggataacaa 

155701 tggctggggt caatggacac gtgctgatca tgattctgtc ggtatggatc gcacaatatg 

155761 gaacggcacc ggctataccg gtcaataccc ggaggaggtt gctcgcgtat acgaatcact 

155821 agaaagcacg ccagatgacc tagtattgtg gtttcaccat gtaccatgga cgcaccgtct 

155881 gcactccgga gtgactgtta tccagcattt ctacaacgct cactatgcgg gcgcagaagc 

155941 tgcacacgga ttcgtcagac aatgggagtc tttggaaggg ctcattgacc gggagcgata 

156001 cgaagccatg cggtcgcgac tggtctacca ggcgggacac tccattgtct ggcgagatgc 

156061 tatcaataat ttctactaca acatgaccgg aatcccagat gtggctggac gtgtgggtca 

156121 ccatccgtgg cgcattgaag ccgagagtat gggattagat ggataccaga cgtatacggt 

156181 cagtccgttc gaggctgctt ctaatactac agcaattatt accacttcta attcaacgac 

156241 tgggacggcg agaactagca tcaaggcccc atcaggagta tacgacatag gggtgaacta 

156301 ctacgatctc tatggcggtc aatcgaagtg gacactatct gtgggtgata aggtagtggg 

156361 ccaatggctt ggggatatgg agcataattc cctaggccat acaccatcta tatacttggg 

156421 tggtcactcc gccactcgga ttacgtttca tggggtcggc attcggcagg gtgatcagct 

156481 gaagattgtt ggtgaggcga atggggtcga gcccgctccc ctggattata tagtgctact 

156541 gcccccaggc ttagttgact gatatctgta aggttctatg taccatgttg ttcaatgtat 

156601 tcgcagatgt aaccctcact gaatctgcat atatttggtg ttcttggcag gacttacaat 

156661 gcttggttct cgtgacgctg caagctatcc gacttcacaa ttttctgctc ttcaaacaaa 

156 721 aatgaaagga taatttgtct ttccatgtcc aaaattgctc aaagtgcaag agtgtcttca 

156781 ttgcgaactg catagagtag atgtggccag catgcatcct taaccgcctt tatcctagcg 

156841 gttgacaggc gaacactata ctatcaacca ctatccgcac tctacaagag aaacaacgac 

156901 ttgcgcagga atctaatatt ttttttttga aattacacca aaccacaacg ccctagaggt 

156961 gtcatcttcg gataagtttc ttattaaagc ttttactatt ctttgttctt tagttttatt 

157021 aaattatcgc gataataata taaacatatc taactagccc agtttaagga atccgctgtg 

157081 gaaaggcccg atcgcgtgta aggtctcggg gtctcccaag tacttactga gacaagccaa 

157141 cggaaacgta aatatcacat gaccctatca cgaggacctg tgtcagaacg tgaagggctg 

157201 tgtgacaccg caggacatcc aacaactcat acaccattac gttattcgaa gggcaaaccc 

157261 ttcatttacc cgccaatcct atatatttat ttcgaccgtc tacgttcgtt ggcatcttca 

157321 acaaagatgg acggagccga gttgcaatca cccctactcg ataatggaca accccccctt 

157381 gtcgcgaagc gcgtaacttc tgaagagccg aatgctgccg ccgatgccaa aagactcaaa 

157441 acatctgatg atagggaggc accgccgcag acaaaccttc ccgcggcaac ccggattgtt 

157501 ccctttccag agaaagtatg ctcggtcctt gaaatgtcct gaaactctct gaacgtcttg 

157561 aagcctgcag ttgtagaaga acggaactgc gaaatagaat ttcgggttgt caacaacgat 

157621 ggttccacgg agagtactgt cattctaaca ggtctcaaga atctgtttca aaaacagcta 

157681 ccgaagatga ccaaagatta catcgcccgt ctcgtctatg atcacactca cttgtctttg 

157741 gccatctgca aaatgcccct ggaaattatt ggcggaatca cattcagaga ggtccgacat 

157801 cgcaggttcg ctgaaattgt gttctgtgcg gcttcatcgg accagcaggt gaagggatac 

157861 ggggcgcatc tcatggccca cttgaaggac tatgtcagag ccacatcccc agtgatgcat 

157921 tttctaactt acgctgataa ctacgcaacc ggttatttcc aaaaacaagg cttcacgaaa 

157981 gagatcacgc ttgacaagtc catctggatg ggttatatta aggattatga aggcggtaca 

158041 ctcatgcaat gttctatgct cccccgaata cggtatctgg aagttggtcg tatgcttctc 

158101 aagcagaagg catgtgttct agctaaatta cgccctttga gcagaaatca tattgtacat 



158161 cccccgcctc cacaatgggc caacggcatc gtcactctga tcgatccact ctctattcct 

158221 gctattcggg ctactggatg gtctccggac atggacgagc tagcacggca accacgacat 

158281 ggacctcatt ttaacgagct tcgacgtttt cttagcaaaa tacaagccca caaacaagcg 

158341 tggccctttc tctctcctgt caacaaagat gaggtcccgg attactacaa cttcatcgaa 

158401 tctccaatgg atctatcaac gatggaggaa agattggaga atgacacata ttctacccca 

158461 aaagagctta ttaacgatct caagtcaatc tttcgcaatt gtcggcagta taatgatgca 

158521 acgacgattt atgccaaaag cgcgacaaag ctggagaaat atatatattc tctaatcaag 

158581 gaggtaccag agtggtttga cttggtgaaa gagtgacacc aatgcattag gttctgctcg 

158641 agctggggca aagcttcctt gacacctcat ctacttagtt gcatcacgag ttagcactcc 

158701 cccaatcagg caaattattt ggctacagcg cttcgtcccg tatccaggaa ctttcgggtc 

158761 atacttcctt ttcgagatat tcgcttcttg ggcggccttc cggtacttgt cctaacagcc 

158821 caccgggctc aacttctcag ctgtgtccca tgtcgccggc tcttttttca ccagtgttgc 

158881 caaagccttg tgcatgcgat tggccaaatc gttattgaag gatcgccacg agcaataaat 

158941 taatatctct aagacagaag caataaccac actctgctgc gaaaacggat cgtgacgagc 

159001 gcgtagtgga cgctgccgct cttctggatg ggggtgaaac aggccacaac ttgcccagcg 

159061 gaacactcca atcagctgga ggactgccaa tatcttcatt gaactcgacc caatccaaca 

159121 ctaactgctt atccttaaac tatgatgaag ctgacttata ttgctggttt atagtactta 

159181 ctgctgtata aatatacttt acttttggta cttattttcc ttatagttta ctacagtaat 

159241 aatctttcta tatcatgata tacgttgctt tgtgatgaca tcttttacta aataataatc 

159301 agaaccatat tcttgtttca ggcagtgaaa ttagtaccta tatagcccat attatatata 

159361 gcaggattga tcaggcataa gaataattaa ttagaattaa attgaaaaag aaaaaactgc 

159421 actccacaat accagtgctc ggtcatctat tatgccgatg aaattattaa atctcccccg 

159481 catcaaacat cagcccatct gcgatgtcaa caaccggtta aattggcgtc tgatcgcagg 

159541 tcctccgcat tccccgccct gggagttgcg gatcaacaat gtggtagccg actgcccgct 

159601 ttatcttcaa tcagtaatgt caagggcagc gaggtataag aatgtgctaa ggtggagtag 

159661 ctctggggga gtttcgaagc tctactgagc caccgggctg catcttgatc tacaatgaag 

159721 acctcatttt tgcttgctat aggtctggct gtggaggctt gcctaggcct ggtttccgcc 

159781 ccgaattatg ttcgtcagat taatgccaca gactcttccc tccaagacat cgtgacgtgg 

159841 gatgagtact ccataagggt acgcggggag cgcattcttc tcttattggg agagttccat 

159901 ccattccgtc tcccttgtcc ggggctctgg ctcgatgtct tccaaaaagt ccgcgcgctt 

159961 gggttctcgg ctgtgtcctt ttacgtcgac tgggctcttc ttgagggtga gcgagggtcc 

16 0 021 atccgggcgg atggagtttt cgcgctcgaa gagttctttc aggccgctac agaagccggt 

160081 ctgtatttga ctgctcgtcc tggtccttat attaatgctg aagtttccgg gggcggattt 

16 0141 cccggctggt tgaaacgggt gcaaggccgg ctcaagacta cggatcaagg ctacttggat 

160201 gccatcactc cctatatgca ggcaataggg cgcattattg cgaaggcaca gatcacgaat 

160261 ggtggcccgg tgatcctatt tcaacccgag aatgagtata cggcttgcgt acaagacgaa 

160321 ggctacacgc aagtcagcaa ctacagcatg cccgacatta atagttcgtg tcttcagaag 

160381 gagtatatgg catacgtgga agagcaatat cgaaaggctg gaatagtcgt gccgtttatc 

160441 gtcaatgatg ccgatcctat gggcaacttc gctcccggca cgggagttgg cgctgttgac 

160501 atttacagct tcgatgatta tcccttacaa tggagcacag cacgtatgta ttcgattata 

160561 ttgaatggac actgttaatc atctacgtct agcatccaat ccgtcaaatt ggtcatccct 

160621 gataagccca ttgctctcct acaatgaaac cgtccatgaa gaacagagcc cgaccacgcc 

160681 gttttccata tccgaatttc aaggtggagt gcctgatgct tggtatgaaa tgagtttgat 

160741 ctgtcctatt atgggaagca tcattaactc acaatgttct ttaggggcgg tgttggcatc 

160801 gagacttctg ctgcatacat cggtcccgaa ttcgaacgca tcttctacaa gataaattac 

160861 ggctttcgcg ctgccatcca gaatctttac atggtgagtt ggttgctatg tatgcttatg 

160921 ggcctttctc cctccttgct aatacagtgt ctgtgctgaa gatctttggg ggaacaaatt 

16 09 81 ggggcaactt gggccattct ggtggttaca cctcgtacga tgtgggtgcg gccatcgccg 

161041 aagatcggca agtcatccgc gagaaataca gtgaattgaa gttacaatcc aacttcctac 

161101 aagcatcgtc tgcttatctc gagacacact cagataatgg gagctatggg atctacacag 

161161 acgccacctc cctcgcggtc acccggttag ccgggaatcc taccaatttc tatgtggtac 

161221 gccatggaga gttaacctcg cgagagtcga cttcatataa gctacgcgtg aatactagcg 

161281 cagggaacct ggcaattcct cagctaagtg gatccttaag tctacatggt cgagattcga 

161341 agatccactt ggtcgactac aacgtcggaa atgtcagtct gatctattcc acggccgagc 

161401 tcttcacatg gaagcaggca ggctccaagt cagtggttgt tctctacgga ggtgaagatg 

161461 agctccatga atttgctgtc cctgcaaata aaggcaaacc aacctctatt gaaggcgacg 

161521 gtttgcaggt tcagcagatc aactcgacta ctgtcataca atgggcagtc caaccgtcac 

161581 gccgagttgt gcatttcagc gatacccttg aggtgcatct actttggcgg aacgaggcat 

161641 acaactactg ggtcctcgac ctgcccgttc caggggctat aggccgacac gtctcccgat 

161701 cacataccaa ccgatctgtc attgtcaagg ccggatatct gctacgaact gccgaaataa 

161761 ttggcacgtc cctctatctg acaggcgaca tcaacaccac cactaccatc gagctcattt 

161821 ccgcccctca gccagtaact agcattctct tcaacaaaaa tcgtatcccg actaccatca 



161881 catcaccagg ccgcctcacc ggcaccctca cctaccacaa gcccaacatc agcctaccag 

161941 acctcacaac gctcgactgg tactacctca acaccctccc agaggtccac gatcccacat 

162001 acgacgacca cctctggact ccttgcacac acaccacaac cgcaaatcca cgcaacctaa 

162 061 cgaccccaac atccctctac gccagtgact acggctacaa cggcggcacc ctcctctaca 

162121 gaggcacctt caccgcaacc ggcaatgaga cttccctcta cctcctcact gaaggcggct 

162181 acgcctacgg ccactccatc tggctcaaca acacattcct cgcctcctgg ccaggaaacc 

162241 cagcgttcct cctctccaac caaacaatca cattcccgtc ccccctcacc ccgggaacca 

162301 cctacaaact aacaatccta atcgaccacc tcggcaacga cgagaacttc cccgcgaacg 

162361 gcgaattcat gaaagatccg cgcggaatcc tagattacac tcttcacggc cgcgacgaca 

162421 aatcagctat atcgtggaaa atgacgggta actttggtgg agagagttac gctgatctaa 

162481 gtcgtggtcc actcaatgag ggggcgttgt tcgctgaacg aaagggatat catctccccg 

162541 gtgcgccgac ggagcagtgg acgaagaggt cgccgtttga tggactcccg gaggatgaga 

162601 ggcccggggt ggggttcttc gcgacgaaat tcgatttgca aatcccggat gggtatgatg 

162661 tgcctattag tgttgtgttt gagaatagta ctatggctgg tgatggatcg gggcctgcga 

162721 gattccggtc cgagcttttt gtgaatggat ggcagtttgg gaagtacggt atgttggcct 

162781 cccctaccgg tttctattat gactctcatt cttcctactt tggactctgt gagctaacgc 

162841 gttctgttta tcttagtcaa tcatatcggc ccacagctta gctatcccgt ccctgagggc 

162901 attctaaact acaatgggag caattacctt gctctgacca tctgggctat ggatgagaag 

162961 tcgttcaagt tggatggatt gcggctgcag gctaacgcag tggtgcagag tgggtatcgc 

163021 aagccaagtc ttgtgaaggg agaggtgtac aaggaacggg ttgatagcta ttgattctgt 

163081 aggctgcttt gatacttgtt agattctatc ctaggaataa ggtctgatct ggcacatata 

163141 ctgggaatca tttctagcag atactgatga attacacatg cgctcgaatc tcaaattata 

163201 acctttctta catcccgtca taggactcaa gatccactag ttgctaatct gtagacatac 

163261 ttggtcgcaa atccttttcc tttgacggcg tttctgcatt tttcgggggc ttcggagtcg 

163321 cgccaaagac acggcgcaac gtacaaggcg cataccgatt agcgacagtg acgtttataa 

163381 gcgtctctga gccattccca ctgacaaaag tgttagttgc aaccatgtga tgccttcggc 

163441 ttaaggcaag ggtacaagat ccatggttgc cgtattcgta agctgaagct gaggcgcagc 

163501 cgcggatgac ttgatcgact atttgcgaaa gagtcatatc aggcttcatc gtctgcatgc 

163561 agatccaacc atgtacattg aaggtcgatg tgctttcagg ccttctatgt ggtgagatgt 

163621 ggaatacaac tccaatggct ctcccggaag aattgtgtaa tctaacagca gcccatgttt 

163681 ccagagaagt cgagtcattg gagcttagtc gcagtctgtg attttcagca tcccagattc 

163741 gcgatgggta gtactccgtt atgctcagag tcgctgctct ggactcgtct tggtcgatgt 

163801 tgaattgaca tgtcagttgt ggtcctcggt cggggacgtt tggttgtgat cggatataga 

163861 gatctcgagt cgtggctctc gtttctagcc ctaagtcaac tctcactacc cgcccgcgcg 

163921 agatatcata atcgatatca aatgtgttgt ctgagcgggg cgtaagagga agggccaagg 

163981 cgcctgtgac atcatcctca tagtggcaat ccaggagcgc gaggcagcta tccttagacc 

164041 cggatcgata tatgacgggg agctggatgt ggaggcctcg gtttgtcatt gtgtatggcc 

164101 cactgttacg cggacaagtc cactggacga cgtttttgag aaagcaggtt gaggacggag 

164161 caaggactgt ataccggctc tcttgctcaa tgtcacaata gttccacgcg aagatcgtgt 

164221 gatcatcgga actcttgatg atctcctcct gaagtctgac aaatgaagct ttgccttcgc 

164281 catagagaat aggcatatga acatcgaaca gaccgaggag acaatacgct tcatcctcta 

164341 tcttcgaagt ttgcctccta gctgcccatg acatcttctg tgcgacacac actgatctca 

164401 gtgactcttt ggtgtctgtc gagtctctaa gaacacgctg gtctatgcca gtgatcgtag 

164461 agatgagggc tgtcattgat gcttttgtac caagaacgtt ccagccggac ccgtaaaagg 

164521 tcaagtcttt aggcgcgatc aactcctgca atgtccatcc tcgagtgaac cagcgacatc 

164581 ttgcaaaggg gccattgatc gtaccatcat atttactagg cctgcgcctt ctggcgggct 

164641 cgacggaact tccggcttgc tcatcgagcg gaacctcccc gtagccgcta ggaatgtcag 

164701 acaagtaaac gtagcacttt cttgatcgct tgtaccattg gaacatggag ttgatcgctt 

164761 cagatatttc cgcactgctg cttttgttga tgcaacatgt atcgacccaa gcgtagtcca 

164821 ggccgtccct gatagcttgc tgacagacat acttgatctt gaggaatccc tcccttccta 

16 48 81 caacacgatc ccagtacctg gagcctgttg atagtgcctg aagatcctga aaagaaatct 

164941 catcctttcc ccatgtatgt gataggattg cgtattcggg gatttcactg tcgaggtatt 

165001 cttccaactg gtatgttttc gcgttgatga gtctcatctt tgcgctcaag ttcaggccaa 

165061 gaaaaatact tacaaagcta agtttgcatg gtctgtcctt cgctatttag gtggcgtggt 

165121 gatgtgttct tggcctacca gttggctgct acgtggcagg caacacatta agaaccaata 

165181 tgtccagctt tactgcattg ttggatactt tcccacactt cttcaatgcc aaaggctgta 

165241 tccttttctt atgatatgcc gcgtttcgag ttgcctcgtc gctcgcgtca tcagctggac 

165301 agcaaatggc caggttgccg ctgggaaggt gtcttctttg cctctgagtt gtctatggag 

165361 cctctctacc ttcttttgag gttcgagcag tctcgccaat cctggttggt attccatatg 

165421 tgaagggtat tacaagacct ggggtctcgt atttctttct caatagctga cgacgaggat 

165481 actttatatc gttgtcactc agcgcacgaa caccgaagtt agtgacgaca ataaggaagc 

165541 cactatatcg ttgctaggct cgaatatcgc aacggaacta ttagttgttc tcagaagcag 



165601 ggatgggact aatgttgtgg aagcatcaaa aagcagctca caagtacagg tatcctgtcc 

165661 attacatgta gacgcaagag ctcactcacg ttgcgaagcc gttcgatgag ttggtcagtt 

165721 ttgaaaacag ttttgacaag atggtgatga gatgacatgg atggatcagc tgaccggcgc 

165781 gtggtagtca ctactggacc acgtgtctcc gcttctcgag tgaggactat gagttcagcc 

165841 catctcaatc atcttattct tataccaacc cacatctgtg ctggcagtta ctcaccaaga 

165901 ctcagttatc cttaatatgt ccggccacgt tgcgattgtt gatgtggaat tagaaatgac 

165961 agagtgactg aaactgatgg ataatgtcga gaaactagta tatactgagc atactactgc 

166021 atcagtctat ggccagttat atatggcttg acagggctaa tctactactc tcaggttcag 

166081 caggctcttt caactcggcc tcgatgatgc tctgtccgtc cccgggtggt gcccagaatt 

166141 tcacaaccct gaagccagcc gaatccagaa ccttcgtcca ctggctagct gttcgctcca 

166201 tgctgttaca gaagatcaac atctcccagt cccacattgc tggcagcatc gcacagtctt 

166261 tgtctgccaa tacaaactcc tctatgatta acttggagta tcccattttc atggctgctg 

166321 cgagatgact caggatctga cgactttcct cgtctgacca gtcgtgaaga atgaatttca 

166381 tgtaataggt acgcgcacct gggaagattc cggttagttg tgggatgagt ctcgtctctg 

166441 gaaatatgcc caatgcttac cttggattgg ctgcgcctgg aataggtcaa atgatacatg 

166501 ttcgatcttt ggatgctggg tagcctcagc cacaacgtga ggcagatctt ccaagatgag 

166561 ccgcccttgt acttcggaga acttggactc gaaagcgacc aaatcatggc cgcgtccacc 

166621 tgctacgtct accaggagga cattactttc accctgatga aatccgtcta taatcctctc 

166681 cttgacagga aaccattcaa gccagctagg gcggctcccg cggtccgctt ccatgaaagt 

166741 attgcaagcg tccaagaaac ctggttttgt cggcacccag tcccaaagcg acatttctgt 

166801 cttctccgca tactgcaatg cgcctttcac agcacctgat gggttttgga agttggtcat 

166861 gcgcagaaac tcgggtacct tagccaaggt aggcatacca atatcaaagc tatcgttctg 

166921 tcagtggtgc agccacgcca cttgagtgac tgacttacat aaatctcacc acggccccac 

166981 taagcctgtc tgtcatttgt cttgtcattg ttgtgggaat ataaacccgt acatcatgct 

167041 ctgcaacgta cccaatccct gcaaggatac gcagcactct ttctgttccc gaaataacta 

167101 acttagaagc ttattcattg atgggattag gacaacatta ccagtcagta ctagatctgc 

167161 cccttttgtg gctgctaggt cctccagcgt aactggacca ttccgctcac tgagcgtgga 

167221 gaatatacca aggtccattg cgatgcgaac acacattaca tgtgtgggct ggtacgggtc 

167281 agtctttcat gctcattaat tttgcggaag cttattaccg agtaagctag attatacact 

167341 gcatcttgcg gcttttgaaa cgcctccacc agcttctgcg ccgcctggag ggcagccaag 

16 7401 cgatccgctt catcatgact attatgaaat gaatctgcag cctgcgcgac ctgtcgaata 

167461 agaccgtcga ccggcttttg agaggcgcca ttcgcgtgag aactattcgt aaacatgatg 

167521 aaagatatgc acgggtagca gcactacagg gaacagggga gaaggaatgc cctgcccttt 

167581 tacaatagtt gggatcctta tgtctctggt tcagctgagt agagctatag gacggctttt 

167641 tcctcgatat ggaaggaggc gcacgacagt gcgcacagga gtggatctgt cacctcgatc 

167701 agcgtagggg gtgagcgtct acctaatcca agatgcggta cttttggttg tcaacggcgg 

167761 cccgcaagac caggctagac caactagcag cactatcatg ttattagggt tggaggggcc 

167821 ctgtaaaggc ccgcccccgg ctgtcagttc cgtcacctag aggccgtctt cccacgcgcc 

167881 acacgctctg gctgcattag cgccgcgccg gtgcctcgcg gaaccgcgat ggatattccg 

167941 agccttgact cgggctctgc tttcactcct tcaatataac tgaaggtgac actacgccgc 

168001 ggcataagtc ctgaacgttg tcgagggtct ctggaggtca ttgcgcgcat ggatgcaata 

168061 ctataggctg cccaaaattt ggtccgcaca caccaggcac gtcgatctat tggctggacg 

168121 ggagatgcct gttgagggct gttatggccc ggttacttag caaaagcaca ggcctaaggg 

168181 gcttacaagt gggtcgtttt gacatatggc ggtcgaacct tttcctatgc tccgacggcc 

168241 aaatcacggc aaatcccgag acatggctac cccggacgcc atttgttgat agtcgatgtg 

168301 tcgctcgacg gctgaacaag gaatctcttt gttcctgtca actagtctat tgaattatca 

168361 aatatcgacc atatcgctcg atcgggaaca tgcctgatca aaaaaccttt atatttcaag 

168421 aatcctgtaa agagtgctcg tatcctccga gcagaaatga atgattgagc ccgaccacca 

168481 tgcctgccga caaaggccct cagataactg ctgtgtcatg gagctttggg ggcgtggcaa 

168541 tcatagtcgt caccattcga ctgtataccc ggctgatact gactcgcaaa gcaggctggg 

168601 acgacttctt cattgttctt tccctggtaa gctacccctg caagagcctc ctgtataata 

168661 catcggctaa cccagacggt gcatggatgc agttgagcgc tattgtatgt tcagggctag 

168721 cccaaaccgg cgtgcattat gggttgggga aacacatggc cgacatctcc aatgccgaat 

168781 ggaaaatcga ggcttttaag tacaccgtca tcgcaccaaa cttctccatg gtcagcacca 

168841 caacaggaaa gctatccgtg gccgtcttcc ttctacgcct gatgggtcag acggcttctc 

168901 cggcaaaacg atggttcctc tacatcttca gtataatatc tgtcgcctgg aatgtgctgg 

168961 ccattgttgc catcatgggg tattgccggc ctgcggagaa gatctggcgg cctgaggttc 

169021 ctggatcctg cttcagcttg aaattccagc tgatcgctgg tatatcgcaa gcctgtgagc 

169081 gtcaacactg cctattgacg acacattgag ctaagactgc cgcacagcat tcaacgcatt 

169141 cgccgacctt accttagcct tgtttccaat tatcatcttc tggtccgtcc agctgccctg 

169201 gaaaatgaag ctaggtgtta tagccgttat gggcgcggga atcttgtaag cacctccaat 

169261 acagccaaag aaactctact cattccaaaa agggcggcgg cagcaacctt ggttaaggcc 



169321 attctgctaa aaagtcttcc ggcccattca gacataactt gtgagcacca aattcgcggg 

169381 cgggcccaga cacataagaa aaagcttaac atagaactcc agggtcttgg gccgatatta 

169441 caacatggta ctcgtaagcc actcacatct ccatggttcg ttattagtaa accaaaacaa 

169501 aacaagctaa caccatcctc tccgcaccac tagaatcgag gtaagcagga accttcatct 

169561 cagcaaagcc tagctatact tttatcatgc gctaagtggc gcccatccgg ttcatcgaca 

169621 gatgtatgtg atcatcatct gcgccactct ccctaccctc cgacaatcat acaactttgc 

169681 cttgcatcga acacgctatc tgggttcata ctacaaaggc tcccactccg aagcagcagc 

169741 agcagcagca gcacgacaga agcctattcc tctccttcgg cgccaacccg atcagtcgct 

169801 tttcgagaca ttcgctgacg aaaccggtct ggtgaatgta gcctctcacg aagatagcca 

169861 cgtctcgcct gaaaggggtt caaatcatac gtccatcaca aagactacgg agattgaggt 

169921 ccacgaggag agcaagtcca aaggggatat cgagaatcct cacttcccgc gagcgaaccc 

1699 81 ctttaggaca gatcatcagt aggataggac ggaggagtga tccggggacg acggttcgta 

170041 tgttagctag atcgatattt cgtcaataga aacgggtttt tgaatgtacg actgtgctag 

170101 tgactgaatt attgcttttt agctgggatc ggaaataaat ctccacggga gattaggagg 

170161 gagaagcagg agccaatgca atcacaagtc tgaaggtcaa tcaaccattt atcaccactg 

170221 catcaattca acatccatct cgtctgcaaa aaaagttctt ctcatactca tctgtgttat 

170281 gatcagtcgg aattggctat tcacttgcta ttttcaacgc catctaatag acagtcaacc 

170341 tactctcaca gcaaaccctc ctgattgatt cccgagctca tggcatgtcc ccctaccgca 

170401 acctgcccta tcatatcttc atccctcgtc accaagagcg ctttcaaatc taaagaaaca 

170461 cgtgaaaatg gcagcgaaag cgaaagcgaa cgcaacagcc aacatatcgc caagtccact 

170521 ggaaacgcaa acgctgaact gcaacagacg ggcaaagtta aagagctctt tcaggactac 

170581 ggtcatggtc gatcgccaga atctatcacg gttggtgggg catatgcgcc catgagtggc 

170641 tactttttgc cgagcaactt gcttgaagat caaaatgggc cgaggacgcc gaaaataatt 

170701 gcggaagggc tagcggtacc ggctgggaaa gtgaaaggag atgactctga gcctgtgtcg 

170761 tcagccatgg ttacttcgag gatgaaattg aatcaggtgc tagggaatag aaggaaggtt 

170821 ggttgtgcag cgtcgcttcc tgctgggacg gctgcgcatc ctgttgaaca tggtcctctt 

170881 gctggaaggt cttctggggt tggtgtgtcc ttcggggctg agggggatcc cgctttcgag 

170941 ggttcatatg aggatgatta tgatgatgca tgtgccaacc acgctgcgag tgatggtgat 

171001 cgcgaagata aagaagcttt gaaaaatggc ggcaagactg ataatactgg agagtcagaa 

171061 taccccaggt tcagtcaagt tgattatttt acgcaacatc aaagcttaca aattcggaca 

171121 accgggcttc ccaagcgacc aagatattct gcggctactg ttggatttcc gtgtcctgtt 

171181 accgtatggg ctttcataga taggggtgat cccaagcatg ggatgccaac ggcagcagat 

1712 41 ttaccctggg agatccgaac attgcatgag acaaacgagc tgaacacgat tgcggcttat 

171301 cttgctagca tgcgacgaaa cctcaaagat ctagacgtca actttacctt tcactaccgc 

171361 ccagtgtcct ttattgacaa agcagatcgt gttaagaaaa ttgggggata ctcctcctct 

171421 gacaagcaaa atgctcatac cgtcgcagac ctgtatatgc atacaaagga cgatcggagt 

171481 tgggggacgc tctgcgttac gagggagcat catgtccagg gaatggcctg gcattactgg 

171541 gcagttgctg ctatctcacc acccaagtcg ccgttccagt cagtagatga gaaaggggta 

171601 taccttctca tgtacgactc aaggccgctc aaggaaccga ccaaaaagcg cgaccgggat 

171661 tactggagga actatatgcg tcgtgatcag tacaaattac tggtggagat agggaagaat 

171721 ttcaaggtcc ttgatctcgc aatcaatcgg aggcagctct cggtcgaggg tgaggacgat 

171781 cccctgcgtc tgaccctgtg gtggctttgg aatattgccc ggtacggcgg gggcttctat 

1718 41 gagagagacg gcgagctgga cgacccacgc tggaggttga cagatgccag atggctccac 

171901 tttgatcagg attggtcgag caatgtgcga acagcattgc ggtggaaaaa tgttaaagaa 

171961 tggttctcag agcaacggct ggatgagcag ctaacccacg agcaaaggat ccaagcacat 

172021 gcggtagctg cctagaggca atgacccgtt aatgtggcat aacaagaatg caattggacc 

172081 ttttctgtca ttctgcaaat ctaatattga gcttcggcaa cgcggttctg caggtggctc 

172141 agtttaacga gcctcgtgac ccacaagacc gtccatatga gggatagaga tggagcggac 

172201 ctagggtgtc aggcacttac ggccgatatg gtctggtccc gaaaccagtc gtctagttac 

172261 gccagttgta gcaacagagc cgagatgagg cggagtaatc ttgattcgtc cgggagtaac 

172321 ccgggaaaca tactcaaaga ggaaccagat cacctacaac ggagcaaaga atgcagtaac 

1723 81 aagccaatca tgatgctgct catcggcagc agtccgcgat ctggtcgagt taatctggcc 

172441 ttcgatcttt caacttcaat gggagatatc ccgaggttcg tttttgactt tcggaatgcg 

172501 gcgaattgtg attggagtag ttgccactgc cgcggactcg ttgacatcac tgcaaattcg 

172561 gccgacgagg gctcttgtgt tgctgcagcg ggccgattaa ctaggttctt ccgtctgtga 

172621 cttttcggag acgccctgag aatgcagaag aaagtatata accctgtata cataatgtta 

172681 catcccgtta ccttgcttgg ataaaattca taccaaccca gatgaatgtg tcgatgcaag 

172741 cccggctttg tgcgtcgcat cagcctccat gacgccactc aatcattggg gcatcaccct 

172801 cgttgttgtc cccattgtgg catgcgtcgt gacaactctc ttctttatcc tccgattata 

172861 ttctcgacga cttgtccgtc agaaactcga tctcggggat gtgttgatgg gtatagggct 

172921 tttcttctca tacggcctaa ccctctgtac tgtgatatgt gagtcacttc ataaccccaa 

172981 tttcaatgct aattagctta catgtcatct agcggcattc aacggcgttg gtcacgaatt 



173041 ctggtccctc tccaggccta ccagagggac ggtcactttg gtgagtatac caatctcccg 

173101 cagtgaggtc caagttctaa tcgaacgggg tagctctttt ggtgctcaac gaaattctgg 

173161 gtattaagcc acgttttcgt gaaattctcc tatatcgttc tgctccgcaa gcttttcgga 

173221 gccatcacat attggcggag gcttaccacc gcattgatcg tgttcacgct tgcctggggc 

173281 attgcttcca tctttgtcag catattccaa tgttggcctg tgcgttattt ctggatcaag 

173341 cacattgatg gtagctgtat gcaggggcga aataccttct acattgtggt tgggtcgatt 

173401 gcgctggctg agaatttcat cttggtctca atgcccttgg tcgtggtttg gggaatgaat 

173461 ctgtcgcccc gacagaaggt agaactaagt cttatctttg gattcggtgg attgtatgtc 

173521 tcccgctcta caactaatcg aactggttga atgaatggta ctgacagtga acagggtctg 

173581 cgccatcggc ctcttgcgta tggtcacatt caagagatac gtgacggcag atgctaccag 

173641 tatgttaatc catctaccct cgtggtcaaa tgacaagaag ttgctaatgc tctccagcaa 

173701 acggatatct ccaggctata tggtccatcg tcgaacttga gttgggcatt atctgtgctt 

173761 cggtgatatt gatgcgcccg atattccaac cgcctctcgg aatcattcaa aatgaaatgt 

173821 tgcaccatgc gtggccaaac atcaaagtcc agttactctt caaaaggcgg gcagacgatg 

173881 agttcgttgt catggccgaa gccgctggcc tttgcaggga cgggatcagt ggatgagtgg 

173941 tctcaagtgc gtgctgaagc atacagttcg tgtgtgcgcg gtagtgtgga tgataatccc 

174001 agtttagctg ggatccagga cataatggtc gaggtttcga ttgatcgcga gatgcgaacg 

174061 ggggaggaga tggtgtagca catgactccc atcatgtcag ctgataccct ggcgtactcg 

174121 atgcagtcat tatcttctct gtcaatagta atctactata cttggtacat atttttattc 

174181 tgatagatat tcttatgggg atccaaaggg ataggagtgc gcgctatagc cagtccagaa 

174241 tgagaaatga gaaggacttc agggaatatg atgggggatt ggtgccgatt tcgagctata 

174301 ttggagaatc cgcaactaaa ttcatctcct gtgagcaaat gtgagagatc tcggggcatt 

174361 actggtccta aagaaagata tccggtggag tgatatgttg tgctgcggca aagatccgac 

174421 tcattctcct cctcacgcgc agtttcgtcc gtggacaacc acaacgccag ctccacacca 

174481 tcccaggtta ttcagcacat gtcatttgtg ttccttgatc gctgacgtat cttttctttt 

174541 ttttcttact atctaggaat acgctgaata cgatatctgc cctccaagat gctgcattcc 

174601 ataggtcacc gctgcctcga ccagcgtatt gcggatagac aagatgctca tacgatttcg 

174661 gattcatctc tccggcaagg gctatcagac ctgctgcaca taatgcccat ttggtcggcc 

174721 atgactggct catcaactcg agtgatcgtc ctaatgatcg gcaccccgcg ctccccatcc 

174781 tcggtccgcg ttccgttcaa gatagtggca tatgaacccg ttactatgat gatgcaacta 

174841 ctacggaggt ataccacgag gattaggctg ctgactatag gaaacaatgt ataggagaat 

174901 ggtattacac aaggacgcga cttctatttt catgccgccg tagtatatat tgggcaccac 

174961 agaccgacct tcttcatgga gcatccagca tcatagatct atttctctgt ctcaacatac 

175021 tagtgaaaca tctcggattc atccagcaag ataacactgt ccatcatgcc tctcgcaagt 

175081 gatgcattcc aggcagatct ctcacgtctc tcaccttccc atcttgaagt ttcacttccc 

175141 gtcaccctct ctctcgccgt cctggcctac cttctcctcg tctccaccct ccgcttccag 

175201 cgcgttcgca tcctctaccg cgactaccca cagtatacaa ctcgtgcatc catgtcccaa 

175261 atgaccgtgg acgatgcatg ggcgatacaa aagaacatct tgcaacttga gttccctacc 

175321 actgcagtga aagccctcca atttgcgctt tttcgggtaa cgattcctcc ttacgtcccc 

175381 tcctcttgac atactgacgc catgtgatct cccatacaga catatggcat cccgaccatc 

175441 tcctccctcc ttctgcatac ctcccaattt tcaaatcctg ccacctcctt caaacgctac 

175501 gcagacacgg gtgccctcat cggtcagttc gtgacatgcc ctcctacctc ctctcgtgca 

175561 cgcaccgcca ttgctcgtac caaattccta cactccggat accgtgccag cggtagaatt 

175621 ctcgagtctg acatgctcta taccctgagc ctattcgcca cggagccgat ccgcttcgtg 

1756 81 gaacgattcg agtggcgagc aatgacggaa ttggaacgat gtgctattgg aacttactgg 

175741 aagagcctgg gagatgccct ggatatcagt tatgatgagt taccatcggg aaaaagcggg 

175801 tttagtgatg gcctgcattt cttggaggag ctccggcagt ggggtgacca ttatgaagaa 

175861 gagaacatga gacccgacca ccgcaataga ttggttgctg acaagacgat ggacgtcatt 

175921 gtttatggat ttcctaagtg ggtgagaggc gtcggggtca ggtttgccac ctgtgtgatg 

175981 gacaacagat tgcgagaagc gatgatgtga gcttttctgc atagcacatt ctttgtgatg 

176041 gcttctgcaa agagccatgt gctgatagtc tgaaccaggt acgatgcgcc acccgtgcgg 

176101 tatcggagga tttttacggg gctggtagcc ctacggaagt attttctgcg ataccttcac 

176161 cttcctcggc cgatgattct ccgacaggaa gttttcagtg atactcccaa tgaagacggt 

176221 cggtattatg ttcaaatatg gaaaggaatg ccgtactacg tgcagccgac catgtggaac 

176281 cgctggggtc cggcggcgtg gctgacgtgg gccctcggcc ttccacttcc cggcgatgat 

176341 ggcgatacct attacccaag gggtttcgat gtagccgatc taggaccgag gcaatttgaa 

176401 ggaaggggcc ggaagtcagt gggtgaatat gtagatgtct tggaaaagga acacaaagga 

176 461 cagtgtccct ttggcgggag gagtgaccag ccggagatat gtttgagtcg ttaggaaatt 

176521 ctagaaagga cgtggcctat gaatgacttt gctagaaggg acatcagctg ggtcgaatac 

176581 aacgcctact ttgcacaagc aaatcattcg atcgtcagcg tgggtctaac cgttgcgaac 

176641 catggttctt tcactgtgac agagagcgct gtcggcgagc cttgacatca gaaaccttcc 

176701 cgccacacct tgggaagccc agaacgtcgt ggacttcggt ttataaaccg gttggatgtc 



176 761 ctcgtggcta tctcaattca gcagcaacaa ccattgcccg ccatgtcttc catttcggca 

176821 ccgccaactc ctccgcctga gcgccggctg ccagagggga tggatgcgtg gattctaggg 

176881 gccggcatct cttctctgac tgcagctgtg catctcattc aggaggcgca tgtgccccct 

176941 tctcgtgttc acattctcga aactctcagt caagctggcg gcgggagtat cagtacgggg 

177001 gaccccgtca atggttatga ttgccgcgct gggggaatgc ctccattcaa cgacgtctgc 

177061 atggaagaac ttctttctct ggtcccctcc aagaccaatc caaacctgac cgtcctcgaa 

177121 gaatttcacg agttctggga cactgaagca gtgaaagatc accctcatac tcgcttcctc 

177181 acccgccata aacatggtct tgaacgcatt gacgccaagc gagccagtct gggactgcgc 

177241 gaccgcgtgg acctattcat gctggcctca aagtcggaca agtcgctcgg ccgctcgcgc 

177301 atctgcgacc attttagcaa tagcttcttc aaaagctatt actggatgct cctttccacg 

177361 acgtgagtcg tatcttggta ccgttgattt gccatgcctc actaacaatt cctcctagct 

177421 ttggcataaa acctgttcat agtgcagctg agttccggcg ctaccttcaa cactacatgc 

177481 acgacatcca cgagatccac tgtcgccgga agctagatgg tggccgttac aatcgtcacg 

177541 agtctatcgt ctcaccaatt gctcactttt tgtgctcccg tggcgtggac ttccgttttc 

177601 acaccacagt cactgatatc atcaccaccc cttcaagcag cgaaccgcac cgagtatcgg 

177661 ccatcaaagc catacatgaa aatgagcccg agatgacgat aaatctgggg gagcgggaca 

177721 tcgtgctggt ctcactaggc tcggttatgt ccggctccac aacgggaacc aacacgtccc 

177781 caccgtcctt ggaactaatg gatatcgaga aagacttgga cgagaattgg ctgctgtggc 

177841 tagagctctc caccaaaaac cctatcttcg ggaacgcata caacttctgc acacgcatgg 

177901 cggagtcccg cctcgaatca tttaccgtca ccttttcctc ccccgagttc ttcaaccgct 

177961 ttaccgccct cacaggtgat aaagtaggaa gtgggacttt cgtcaccctc aaagacactc 

178021 cctggctcct gagcatcaat cttccccagc aacctctctt cccggatcag cctgcgcacg 

178081 tgcaggtact ctggggctac gccatgtacc ccgagcggga gggcgactat atcaagaaac 

178141 ccatgctcga atgttccgga caggaaatca tggaggagat attaaaacag ttgaatttcc 

178201 ccgtgcaggg gatcctagac cactcaatca ccctcccatg tgtggttccg cgagcggcgg 

178261 cgacactgtt ggtccgcttg cgcagcgacc gacctccagt gattcccccg ggaatcgaca 

178321 atctcgggct cataggccag ttcgtggaca tcccgggaga ggtggccgtg acgatggatt 

178381 acggggtccg gtcagcgcag acggcggtcc gtcagctaat ggggctggaa aggcacaagg 

178441 tattgtctaa gcgaagttcc gcaatcaatt taaaagcttt gtaggctctt tacgaccggt 

178501 tgaccttgtg gtttgtcgtt gatgcgtgct tcattcagca ttccgcctgg gtttatgata 

178561 catggattct tcgttatact ggctttccat gtcatgatac tctcatccag tgcctctgtc 

178621 aaacactctg ccgattcaac aatctcttcg tgcgatcaca tcgaggattt acatttcaat 

1/86 81 gtgcatctca acaagaccaa aagcacgatt taatgcatgt acattgtttc caaactagcg 

1/8/41 tgttgtgttt atgagttgtt atattaccct ctatcatcga tttagaatga gctttctagc 

1/8801 aattctactc ataactgtcg tagtctcgaa cagtcgacgg gacctgctct cctcttatat 

178861 aggtgggcag gtgggattga gatcacgaga tggatgaagc aagtgtccgg atatagcact 

178921 tgtcgtcgat tcctgtcata gaggaatgca gaagaatgag taccctgatc tgatggccga 

178981 atattatttt ccagggatgt atctatctgg agttgtgtat caagacatat ttggtgaagg 

179041 gatactagta aaatccatac aagctttcct gcacaccgcc aatacataat ccgagtatac 

179101 ataaaaacca agataaaccg ataccaataa cgatgtttga agtaatatga tgccaatccg 

179161 tgccaataaa aatggtggat ataccaataa ggcgtaccta tcgcgccaat aacggaccca 

179221 gattatatgc actaataccg ctgcaaatat aactccaata gaaaagagag tcaatgttcg 

179281 ccgaccccga tgatgttgat caaatatcag tcgacgtcgg atccattccc atgaatactt 

179341 tatacaatcg tctgctcacg gctggaccaa tgcgtgaatc ggtgagggcg ccgtttttgt 

179401 ttgctgattt ctgaaagtat aatggtcagt accaagacaa agatataatc gaataaggcg 

179461 aggagccgta aaggaatata ccttgacgtc ttctaatagc cccggtccat gtttccgcat 

179521 cgcatttacc aaatccatcg cacttgggta ttgcgtcgca atgccatatg ccattgacgc 

179581 cgtgactcgg ttcacctcct ggagcatctt cacgaatgcg tcggatcggt tctcgccggg 

179641 ctttacttgt cctacatcca tacaaaatgc ggagtcattc aagtccatcc gctcccggcg 

179701 atagggcaca gtggaaatat actcagtgaa gttcttgatc cattccgcgg actctgcagg 

179 761 ggagcttgta tgatggatta ggcacgaatg ggtcacctgt aaatcgagga gagcgtcttc 

179821 aaccgtgtcg ctgtctatag gcgtaaccgg ttcaggctgc tgttgctttc tccgcgttga 

179881 gggacgggca gagctgtctt catattgtcg tcgcacagcg gcttggtagg ctctgttttg 

179941 tgcattgtta ttcttccgca ttagacttgt caagccttcg attaaataga tcggtcgaga 

180001 ctttgggtag gctaatttca ggcgctcaac gtggtcgttg atagtatttc ttggttcggt 

180061 tgttggggcc acgatcatat tgataaagtc ttgagccgct atcaaacata ggacatgctc 

180121 ctcttcgtcg cggatgtgaa gcgcgcacgg ttcccagtga ccaagggagt cgttatatgt 

180181 agcccgaacc ttgcgacgcc acttgacgat attcggaatg gtactgggga aaaatgagag 

180241 atcaactttg agaaggcgca taagctccga ggtctgattc ccgagacttg tgtcggtaaa 

180301 tgtagaggcg aggtcaatta gcatctcatg cgtcgaatcc ttcttgtcca ccttgagctt 

180361 gttgacctca gcaatgtccg ctgcgagttg cttttcgcgg gctttatctt ccttggcttt 

180421 ctgcttgcgc tccttttcaa gttgcctgtc gtgttccctt tgtgccttag ccgcctcccg 



180481 ctctctggct ttagcctcct tctcttcgct tgtaaccttg attgctttgc gctggggctt 

180541 ccgcggtggc ggcatctctt caacctctat atcatcgctt tgatcctgga tcctccgacc 

180601 tctcgacgac ctggacgcag attcgccatt gctgccacgg gaacggtctt caagggttgc 

180661 tagtagattc gcagtccgac ttgaaaatgc gtttgcttcg actcgctggg ccacatcaag 

180721 taaatcgtcg acatccggga aatccaattg gtcgctggac acttgaatcg cctctttccg 

180781 gggtcggttg ttactgcggg tgtcactgga attgccagca tcatcatcat cgagtgagat 

180841 agtgtttaca gcacgcgagc ttatggctgg tctgaccggt agattatgag tgggttcagg 

180901 cgccgatgag gtgaagacaa tagggtcgga ttctccattc caggccggag gttttgtagg 

180961 ggcgggaccg tccgaggata agattatgtc gtcatcggag aagaggaaca gcgagttctt 

181021 agctggtgga agcggtacat ctttgacggg tgtgatctca tggctcaagc gccgtcgttt 

181081 ggcgggctgc tcgatggggt tatcgtaggt aaatgcggag aggtcaattt cgtcggagaa 

181141 gatggagtcc gaggcgggta cgacaggcgg gtgagaatgc gctctgcgga tggagagctg 

181201 ctcacgtcgc tctaacgggg gcggaggggt ggaggagagg agatcgatga cctctggcat 

181261 gttgcccaaa acaccaaaag gcccaacctc gtaatggcaa cagcggggat gaagaacaac 

181321 cgccgaactg gagacgacgg cacgtgtcgc ggggaacacg tcggtggcac ggtgtcacgt 

181381 gaagcaattc aatatctact ggatgtgaaa gttgatgaac tgagcatcac gaattgaata 

181441 tagtacacgc aagtcaaggg tattattcta acaatcaatc aaagcagctt cacaaggaat 

181501 gcagcagcga gggcaagcac cactcccacc aatgcaagga aggtaagaag ctgttggaag 

181561 ccatggaacg gccgctgttc ctggccgagc atctcttcga tttcctgctc gagtttctcg 

181621 cccatagctg tcagtgagaa ctccttctcg acacgctctt tcgcggccac agacatcctg 

181681 tccagttcct cctgcttcat gccgtacagc accttgtcca tcaccgcagt ccaatcggca 

181741 tccaccttgg cgtcacggag ccagcccgtt tcgccctcca caatggtctc caaaggccct 

181801 ccagtattcg acgcaagcac cggcacgcca gcccgcatcg cctcaaccgg tacgatcccg 

181861 aagtgttcgt taatcggcgt atacaggagc aacttcgcct gcagcaaaag ggtatcccgg 

181921 aacgccgtcg gcacggacag caggaagagc acatcgatgg aatcgggtat cgaaagggcc 

181981 gaaataaccg tcttagatgt cgccgtctgc aagcccaacc cagtcgccaa ttcatccagc 

182041 tcccgatgat actgcacatt ctcctggacg cgattatcgt aaccacctgc atcaagcgaa 

182101 caaccagtta gcacactcct ctcaaccagc aaacaacgag cgcagcctcg acggccagac 

182161 aaaagggaat acccaccagc aacaaccaat ctagtcccct tcctcttctc ctccccaaga 

182221 ccatgatagg cacgaatcgc caacgccagg tccttcttcc tctcaaaccg attgacactg 

182281 agtaggatct tcttcccacc ccacaacttc tccccatcct tcaccacccc ctcaccccca 

182341 tccccagcct tcgtatcaac acacggataa acgacactca aatctccaac cttctccctc 

182401 ccaaacaccc cactaacaac ccctctcgta aacctcgaat tcgcaaccac cttatccgac 

182461 gcactcacag cccacccctc aaaccaatca aacggcacgc gatacaaccc cttaagcagc 

182521 tgcagtaccg aacccccctc atcccgcctg gcaagcaact gatccggaaa atgacaatag 

182581 aacaaaatcc tctgcttcct gctgttcttc ttcccaccaa acaccttcaa aaagggcaca 

182641 caagcaggca tctgatccac aatgaacacc tcctcctccc cctccccatt atccccactc 

182701 gtagtctccc cacacaacaa atcccacgtc aaatgcaatt gcctcaggat agccatgagc 

182761 acaaacaacc gtccgcccac atgcgccggg aagatcgtat tcccgcgcac ctgcacatca 

182821 agtgtcccat cgcgcgcctc ctcaaaacaa tgcgatttat cccggtgcga ggtgtagatt 

182881 gtgacggggt ggcctcgcga ttgcagtgct agggccacgt cgatgattag acgttctgcg 

182941 ccgccgatgc cgagatcggg gtggatgatg gtaattttgg cggggggcat tgttcccttt 

183001 tctttgtttt tttttttcgt tgggaatttg tgggtgagag tgtaactgag tcggagaatt 

183061 tcgtttaggt ggatattggc attgcggatt gagaaattgg gttaattgtg agtgtttttg 

183121 tacagctgtg actttgtagt tccaattagt tagctgcggt ggtagcagct tgaaagagac 

183181 acgcggccga ttgggagatt gcgtcatgtg atgatctatc tcttatcgct agtcaagacc 

183241 ggggtatgta tatttgacct ctataactat tgacagggca ctatgatcat taaattatat 

1833 01 tagtctggat aatacttgtt actttattga atatcacgct aattttaaat tgttctaaat 

183361 cgagccaccg tctaagctgg tgaaatgaag aaagtgatat caggcaatca atcaatcaat 

183421 atacgtcccg gcaaataaca acttcacacc gacgacaatt ctaactagaa aaagagcaag 

183481 gcagcgacaa tggctgctca tgcgggaaac ggatgggaat aaaaacattc tcagagaaca 

183541 catgtatcaa tgtatggtca gggtagtcgg aaaatgtaca atcacaggcc atacttgtaa 

183601 gtcataaagt caggattcat caaataggcg gagcagaagt caagggggtc aggtgcacga 

183661 gggaaggaag aaaataaaaa agaggtgaaa ggaaatacac agaaggtaaa taagcggtgg 

183 721 tatcaacgag gcgaagaaag ggggaatcga cgttgagtgg gttcataata caagagaaga 

183781 ctgaacatcg tggcgccaaa aggggcttga aaacaaaagg gggcagggca agacaaggcg 

183841 aatgacatcg gctaatgtag atcgatcaag tctaaaagca gttgagagtg ctcttctttt 

183901 tcttcttatt gagcgcattc atccgctcct cgagctgcgc ccgggcggct tcggcctccc 

183961 ggcgagcgtt tcgttcctct tcgacggcct gcagtgagag gcgctgggag gcgcgctcct 

184021 tgcggagcgt ggtcagttct tcttccagct cccgggcctt cttcttccac gcgtcgcagt 

184081 ctgacttgac cttgttgctt tgcgcttcaa ggtctcccag ggcttcttcc aaggtggcag 

184141 tgagctgctt ctctgcgttc aggtgcttct cgatggtgcg aatacgggcc tcctgatcat 



18 4201 cgacaatctg tgggttcacc agctccttac tagtgtgacg ggagctggga ggagagatac 

184261 tgcccgtacc gttggtggcc gaggcaatag tcggtagagg gggcagggga atagcgggag 

184321 gtggcgaagg gagagaggtt gcagatgagc tcttacggag attggacgta atcgagctgc 

184381 tgcggtcatg agagctgtct tggttcttga tctggagcta gaagaacgtt agtcgtcgcc 

184441 agcatgtagt cttcggtata taaacactta cctcgagctg ctcgatccgc acgcggtatg 

184501 tctcaatatc cttctcatag gctgcaatct tcaccttggc atcttccaat gcctggtcac 

184561 gctcctgctg aatgacggag agacggttat ggtgctgatc gaagctattg gtgatctgtt 

184621 cctctaactc ctccaccaag cggacactct tctccttttc agccttgact tccgcaatga 

184681 tggcctcctt cttcttgacc tgctcttcct tttggagcag agtatcttcg agggtggcaa 

184741 ccagaagagc cagctcgttg accttcgctt ccttgctaga gttgctctgc atgagctcct 

184801 ccagccggcc ttcatcggat cccttggtct caagctgttt acgcaaatcc tcgttggtat 

184861 ccagcatctc ggcatacttg tcggagaagt gctgcttctg agccatgatt tcctgtattt 

184921 gctccgccat ggtctccgcg gtaacgtcgg cgttaaggag tttagaaacc atattgagca 

184981 atttcgtcaa ctcttcccgc gactctagga tctccgatct caaagtcttc atcgcctcct 

185041 catgctcggc gctcatcgtg gcgaaacgcg actcatagtc accctgcgca gccgccgtag 

185101 cccgagcttc aagttccgtc acctcctgct tgtgtgactc ttgtagggac tccaggtcct 

185161 tcttgaacga gtcagcaagg gacttgtggg agtcgatctc cttctccaac tctgacacac 

185221 gagactgatg tacttgaacc tcgttgctat cgttgtcctt tgctgccaga agttctgaca 

185281 gctgcagctg ggcttcgccg tgagactttt cgagggaggc aatcttctca aggtgggcag 

185341 cattggtggc cttatgttcg tcaataacat gcttcagttt tgcaacgagt tcttcttgtt 

185401 gctccctctc ggcctccagc tctctagcag cagcatcctt tgaggcagag gcttcaccgg 

185461 cagccgtctg ttcagaacgc attgcgtcaa tagacgccaa tgccttgtcg agctctgcca 

185521 aagtcgcact cagttcctgt tcggacgatt gcagggattc aacggcattg tggtgtttgc 

185581 tcttcagttc cgtgacctct ccttcgagag aagaaatctt tctctcttgc tcttcagcgc 

185641 acagatcctg ggctttagcc gcttcttcct gggtcttcag gaggctcttc aactcttcga 

185701 tctcggcgag aagctgcttc tctctggcgt cgtgtgattc tttcatctcg ttgatgagat 

185761 tctcttgctg caccacttga tcgcgcagtt gattgaccag gcccatgtcg accgaggacc 

185821 caccgcctcc ctgcaagctc gacctctcac gagtgagtcc agagatgatg gttgattttg 

185881 tctccatttc cttcttgacg ctttgatttt cggcctccag cgcctggata cgttccatgc 

185941 gattgtgtag ttcgtgcata gccgagtcga ggtggacttc gaagttctgc atggtatcag 

186001 ggcgggcctc aaattcctca actgcaatat tacgcaaggc ggcaagggag cggtgagccc 

186 061 gatcgaccgt accaatcaga ctctgacttg tcttcctccg gattacaggg gacttgaagc 

186121 ctggggtgct cggcggtgaa gaattcttcg atcgctgcag ttcggctttc agtgtttcga 

186181 tgaggctgag ggttccttca tgctccgact gcaactgggc atacttctgc gacatgatgg 

186241 aaacaccctc ctgatgctcc atgagcaaac gacgcatctg ctccacttct tgagccttcg 

186301 cttcatctat ctctcgagtg tcgacgtggt tctcctgcgg gacctcgcca ttactcaagg 

186361 tagattgctc gtgcgaagaa gcaggctccc cggccaagga taactccgaa gagagtgatg 

186421 gcgaggacga cggttgtttt ccatcgttcg agccgggggc cttgccttcc tctaaaaaag 

186481 acaccgggcg agattcgacg ttggtggaga gaaggtcctg aaccttttcg tctgggtgac 

186541 gggcttcgtc ggcagcgtcc cgttgaaact ccgccagggc tttcatcgct tccgcatact 

186601 tcgcctcgag catttcatac tcactcacag tagactcgtg ctgcatacgc aggtcgaaca 

186661 actcctgagt gaccgtctca agtttgtccg caacgacctt ggtttgggtg ggagtagaag 

186721 ctcgactcgc acgggcaccg ttctcctgag cctccaggcg gtcagtagcc gattctagga 

186781 ccttcaaatt ggtctcgtct gtctcagcag tcgcttcagc ctccacggaa gcttcgtgca 

186841 cggcttctgg ggctggctct gtcaagcctt catccgattc gggtatggcg gtctccacag 

186901 cctcagtcag gacatcgagg gaaagggatg ttgcccgggt gcgaggtttg taggcgccct 

186961 tggcgggaac tgggacatga gtctcaagct cgtcctcgga ctgctccttc tggttgccgt 

187021 tctgctggac atgatcaagc tcatacaaaa ggttatccag tttgcccagg ctgcgctggc 

187081 gctcaataac atgctccaag cgctccattt ccctttgcat cagctccata tcttgatcag 

187141 cttcggccag acgctcctcc agcgtggaaa tatagtcttc gcagttggct tcgctttcgc 

18 7201 gggctcgact cagctccttc cgaagttccg ccacgatagc cgcctgcttc tcttcaccag 

187261 tagactggcc atcaagccgc gcttccagct catgcagata ggtttctgtg ctgctttcac 

187321 ggtcaagcag tttctgaatc cgcgcctgta attgggagtt gacagtctca acgtatgcgc 

187381 actttgtttc gcgttcgaga agcgtgctct ccgtcactga gagcgaggac ctagtgttag 

187441 aaagagacga ctcaaggctc tgaatagtct tctcgtactc caggacgacc tgctcgatcg 

187501 actccgcaaa cgattgtgat cgcttcagac gttcaacaga agttttcccg accaagctga 

187561 ccacatcatt ggggtctgtc tcgccagcat gttcggagtc cctagcgatc tccgagatca 

187621 atttcgcatg acgctggctg agagcgttgt aactctcttg ggtatcgagg agctgattct 

187681 ggagctcgac ctccctctcg ttctgtcgct ccgctcgatc ctgtggggca gcgcttcgac 

187741 ggccgttctc ttccgcatta cggagctgct ggcgcaagaa agcgacctcc gcctttagtc 

187801 gttcgataac tgcgtgcttg tcgctttcgt cggcaatctg ctggatgcgg ggcttgctct 

187861 ggatcgctct ggctctttga gcatactgaa ccgtattcac ggtttcactc agatggaact 



187921 ccgcagggtt gacacatgcg atcatgtatg tataggcgtt accacccaga gagtcctgga 

187981 gcagtcgtgt gagcttcgag tcacgatagg agacgtgagc gccggcctgg cgagaggata 

188041 gctgcgaaat gaccttgccc aaagcagcaa ggcccgcatt gatagagata ccctcctttg 

188101 cccgttcacc agaggccccg gtgttcttca acctctcact tcctgctaaa tccacgaaat 

188161 ggagcttgct atccaccatg acagaggcat cagagccaga catcatatcc actggcatcg 

188221 acatgcgctt gtctttgggt gtcggggaca tgactccgtt ggatgccttc cgctgaacaa 

188281 ggttcagact gaaaaccgcg tgcgatcgag aagacttggc gttaatcgcg gtggaatcgg 

188341 tctgccggat ggaggaaccg aagctaagag cacccatcag atcttcgtat gagttaatat 

188401 tcacctggtg tagacccgtg agaataatac gtcctttggc atcctcacgg atggtgaccg 

188461 tgccccggtc gccctgatgt gtcgattcag gcacaaggag atctcgcaat tgttcgttgt 

188521 agatctgcac acgtcagtat cttcaaatcg cgaagcaaca ttcgatgtgg ggtcacaaac 

188581 ttcaacgtag gtagccttca gctgccagtt cttctcgacg ctagctttcc caaagctgga 

188641 ggcggagccg acagaatagc gggatggagt acgaaggcct gtgccgctgt tgcgattcgg 

188701 tttggcaggg ccctctagtt tttcaaagag aagctgagcc gcccgaggaa tgattcctat 

188761 tttaggtcag tcgaatgcgc cgcagagcgc gcgattctcg gggaggacga acccatcgat 

188821 ctcgggtcac tttgttcact gggacccgaa gttcccatgg tgtaggactt tcccgcgccg 

188881 gactggccat aggccaggat ggagacattg tatccctgta gaaacgagct cacactatcg 

188941 ctgaggtagt cccagacgcc atcctgatcg accgtctcag cgaacacacg gtcaaaaacg 

189001 aacaatttcc ggccctgtgg gacatcgatg gcaacactgg tgggattggt gacgtggacc 

189061 atgggacgct ggaagcgctg gggaaccaat tcatatccgg gatcggttgg cttcaggggc 

189121 ggtcgcactc gaacagctag aaaagtaatc agtatgcaga gctatactcg tgaagccgcg 

189181 cacgtggggc gtttgaggcg ccgaggcgag cggtatccgc aataccgcat ctgaaccagc 

189241 tcagccccaa cgggcgcacg cactgaatgg aataacgcac cgactttgac cgcagttttc 

189301 ccatcctcat ccgatgcccg acttcctcca ccgtgtctac tgctcatact caagcgactg 

189361 gtgcttcggg gggtccgcat gatgccgctg aaaggacgct gaggcgcatc cgctggcgat 

189421 tcaggctgag agcccgtcat ggcactttat acgattgcag gccaggagga gggggggagg 

189481 aacgggttgg gcagagatag atcggcgaat ggtggacaga gatatgttga ttggtcatga 

189541 aacggagacg ggagtcatca gtcacgaatc ggtgcgccaa aagtgcgaat gttcactctc 

189601 ggtgtccttt gattaggatt attatttatg tgggcggttc cgtcgcgtga gatggcggcc 

189661 tgcggcactc tgcgacgtca agcggtctct aacctgcagg tacaaataaa aagcgggaag 

189721 gttcctatcg aaagtgtgaa agctggttag aatgggtgaa cgggaaaggc gggagatgga 

189 781 ccggaacaag tgaattaagg taaggggaaa ggggacataa ttacaggaga gtgtcccgac 

189 8 41 cacagtgcga acaaagggag ggagcaaaca gtttggacac ttacagagag gccgacagag 

1899 01 aaagagagag agcgaagctg caccaccggc ggtcaccgca gaggggctta agggcaatga 

189961 cagaagggag tgtgacaggg aatggacagg aagagttccc cgggtctagg cagtaccagc 

190021 caggtagaaa ggtatctcta cgcacccaga ctagaaatcc gagaaaaaac acccaggtgg 

190081 gagaagctgt cgcggagagg aggagatgga agaactcctg agcggcgagc aaagttaatt 

190141 ggtggcatca aggttgaagc gaaccgggat ggggctgggc tggggcgttg gcccaagtcg 

190201 gccgctttac tatgttgagc tggactgaag tggaaacaag tggacggtgc gaagtggaaa 

190261 aggagggccg gccagcaacg gcgctgcggt gaatattaga ctaatactat taatgaatgg 

190321 agagagagag agagaaagag agagagagga gagagccgat gggaggggtt caattgatgg 

190381 cacgcggagt tacctccaat tcctgtcggc tctctctctt tctctctctc ggtctcttaa 

190441 tacctgaaat ctactttcta atctgcttga aagttgaaac aggtcataaa aagcagccag 

190501 gtaaaaatag taccgcagct gagaccgtgg ttcctgacca tcttccattg acgatccagc 

190561 aatcaagggc cagactgcat gacacccaca gccaaagctg aagaaaacaa cagattgtcc 

190621 gtcaggcacc gtccatgcaa ttgggattaa ccctcaggaa ccaagtcccg tacgccttcc 

190681 ggtatggccg ttacgacgcc aggggcagaa acccactccc aaaacccacc caccagccca 

190741 gctgtcgggg tccagaacac acctcatgat ccctacttgg tggctatcta tcgtcgcgct 

190801 tggtccctgg atcgcaattt gtggcttgtc cagatcatac actgatgcga tcggtggtga 

190861 cctgatggtc ggcaaaagaa aacatacata tattccacat catccccatc cgttcgccgg 

19 0921 taataaaaaa gaagcattac gactcacgag cggttcccgt attgccgaca gaagttgttc 

190981 cacggttcgc ggcggatgga ttgcaattca ggaaccgaaa cggagtccgg agtgccacct 

191041 ttcccaacca catgggggta aggaactcca ttcttgtcat ttttagacgt ttcagattgg 

191101 aaagctgctg gagattcgcc agctgaggac acgacaccag caccagcatc acttttttgc 

191161 catccaccac ttgcttcttt acattggttg aacaacagca acaaatctag taactagtag 

191221 atagccatgc agccggatcg tgaaaattgt catccacagg gcagggaaaa agaaagcagg 

191281 gaagccatac caaaatagca gaaacccccg agaaaaagaa acgataagga aaagcactcc 

191341 agagagcaaa gtgcgaaccc aggcgaccag gatgaaaact taactttaac ccaccatctc 

191401 cttgacatca catgcctttc aagcggaaag agacattata gggtgaacgg gggggatgtc 

191461 tctccctccc ccattccacg tccaccgctt gacaccagtg tcaagtgttt tagtttagtg 

191521 attaaactac tggagaccgt caaatgtgac cggcaaactc aggaaccatc tgcatcggga 

191581 cgacgcatcg catgcgcatt tccctcccca agtcggtaag gtcagtcaag gcaagcgaaa 



191641 tgacgaccta ccagcccagc ctcccaagcc tgcacggtcg gatagggaca tggccgtcca 

191701 gtgcgtaccc tttagtcaga gccagtctgc tcgaaatgga gaagcttgac ccgcgggtgt 

191761 tacagtctgt tgcttgagtg cgggtcaaca tggtgccatg gagggcgtgt gcgacaatcc 

191821 attcccattt aaagagtgga ttgattgctg tccgagtgga cactagaccg cgacttagcc 

191881 agttagcctc caccggcaat ggattaaagg ctatgctcca tcactaacgg tcgtgcgatc 

191941 gcggtggaag aggaagagat tggcatctgt ctttagttag caggctactg ctttatgagt 

192001 gataaaatga tagcgcctgg catccttcat ctcggtcggg atacgtaagt tcccccgacc 

192061 gcctatcagc atcactctac gaaattaatc cggttcccca atattaggca cttcagtatc 

192121 cccgaccgga acgtgggagc tggccccaag catggatcat ggatcatgga cgcctttgag 

192181 ttccctggat gatgtatcaa ccaatcaccc aactactaag ctcacacctg gcatacttca 

192241 agtctgtcgg tcattggacg acatatgcag atgaaaaaga aactaagtat tataaataaa 

192301 ggaacattag aaataaagga acattagaaa tcaacctgcg agattattcc attttcgcag 

192361 actagtagct agcccatggc aattgtcgct tttgtcaact actcggtagg gtatagttca 

192421 ttctacttcg ggccagagat gcattgagac tgtttcaaag atgaagctga gaaggcttgg 

192481 gtgacataag gcaatcttgt acaaccatct gccaagctgg gtgcgctaat ccagatcagg 

192541 cacgagaggc tgtatacctt gtcgaaaacc tattcagatg acttcttgcc tttactgtgt 

192601 ccatgacccg accctctagt tgtgatgcaa tgaatgcata cagttcattc agagtcggac 

192661 ttctaatcca tggaagcgcc caacccttaa gtgtgcacat ctctcatctt caaacagctc 

192721 atcggacgtt gacaaagtcg gggctcgcaa gctatggcag actcaggaag agcaatactt 

192781 gggtatcaca tactacattc attgcatgca aattccgcac tagtccgaca atatatatcc 

192841 tatagacgag gtcctactcc caatgacgta caaatcccag tgaaccaaca ccactcctac 

192901 ttcaaatcgt tccaatttct gcatagtgat tcacgtaacc tccatactcg ttatgcactt 

192961 ctgtaaccag ctataaaacc tcataatccc tatatcatca tagcagcata atctacccgc 

193021 cttgttgtca agaacagaga agtggatgct actatatctc gagccaagcc gattgaccga 

193081 tctatcaatc aataacacca taacccacaa aataggattc aacggctcat tgattgccaa 

193141 tgagccaggg gaaagggata tccgaccgaa cgttattctt ctctgagata gatactgcat 

193201 aaatttctgg aggtacctca atttttcttc acctgccgcg agcacgttat tctatgaaaa 

193261 ctggacccga tgggaagaaa cttgtttaag acaattctct tctgaaagta acccatgttt 

193321 ataattagta gtaggcctga gcacgggcaa ctactcctac ccaatatata taaaatgggg 

193381 gtcagtccat actatgtaga actactacta atcggtatcc ccacataacc acccacatta 

193441 acattataca ctcaacatta ttccacagaa ccgaacccat ccatcaagac acaaagtaat 

193501 aacctaaata ataaatcaaa taaaaccctc aatagcgctc ccaacccgcc tatcccaatg 

193561 atactcttct attcctcatc atatgattgg gtggcccctg aaatttccaa cggccgatct 

193621 tcacccggca atcggcacgt atatctccag cagagtaacg atcgtcgaaa gtaccaccat 

193681 tttgtccact cttcactcct ccaccgaccc aagtcttttc tattgctttt gtattgttca 

193741 tccacggcaa ccatgccaat attcatttac gctattattc acctctattt taacattgag 

193801 gctcatagga aggaaggaag ggatgataaa tgtattatat ccaggtctag atatctcccg 

193861 ctcatcacaa aacgccacaa ctgcctatcc gtatcaaacc cccaaactca aaactgaacc 

193921 tgcataatca ccaatgcacg attgtgaact tgtcacaccc gcacccccat tccacattcc 

193981 accatcttga atcctatctt gacagtaact actccgtact aacagcgcaa gcagctcaga 

194041 accaaagaca atgaactcaa tttccactga tcaaccttgt cacttcacat atagagtagc 

194101 aggctcgtat cgagatacac ttcctattac gcatcattgt atggcattac ggggggctct 

194161 acaataaatc gaattctttg tttaatggca tagcaagtag aaagcttctt cctttcaatc 

194221 atgaaatatt tccgtagatt gtacctgcgt ttcaatcaca ggagaggaaa aagtgataca 

194281 tggccgatgg ccgaaggaat gaaagcaagg atgcagatac ataagagggt accttactta 

194341 agtaataagg tttatgcaag taaggggtcg cgttaagtta aagttagtca acgctcaggc 

194401 agatgacagg acatatttat ggatagatag atggatggaa tgataggata tatcaagtag 

194461 cgtacctgtt gcaaggtaga tagatctatt ctggaggcaa ctaactttgt acttatatgc 

194521 acgacttcgg agggacgagg agataaagca tgattttaaa atagatctat cggagaatac 

194581 gacagacgcc aacgcaacaa ctttgtaagc cacatgggtc actggggagg caactaacta 

194641 gtcgggtgac taactagaag cctgtgcaat aatcgatccc tggcgaatga caatccacaa 

194701 tggggtagag atagatcgca tcttatctag atcagtctca tcgtttcata agtggcgttg 

194761 ccttatgcaa ggggaccgct agatcgagat cgagatcgaa cggtggatga caacgcaggt 

194821 ggataccgag ataagacact gtaagtacgc tttggcattt gcaactagaa ctcgcactat 

194881 tcaataattc tgataaaagg tatgcgctat gacatttttg attcgtgttc tcaataacac 

194941 cctccaacaa tcttcatgtg cttccttgtt aaccttccat tatggcgtcc cggacttgat 

195001 cttccactgg ggcctcgggt gcgttagcaa gcacccatgg tagctcaata tccgtaggcg 

195061 atatgcagac agggtagcct ttgatttgat cttgtgtata cagaagaccg ttcacacaca 

195121 gacatccaat gaactgccag agtctatgac acatattgta ggtagaaccc cgtggtattc 

195181 tcagctgcat atcgttgagt accaggcgta tatgttgata ataacctagc gagttcatgt 

195241 tgcgcccatt tagaccggta atgttatcca ccgtatctgt gaattgccgg agtaggaagt 

195301 tcggtatccg tcgcggtttc tgcgttgcta gggttcgaat aaaagtgtga agagcgacat 



195361 ccgctttctc aataagctga cgctgagttc caacgtgacg aatcaatgaa gcgtatgcac 

195421 tgggagtgag tatgtgctca gcgttgtgga actcgttccg cccccagtat tgcgccattc 

195481 cgaccaaatg gatgagacat tcgcggtaga gctgggcaca ctgaatgtct tgcgccattt 

195541 tagtataggc gagaatatcg tcagtaaggc caaggagggg cttagaggta agagtccagg 

195601 gaagtataat cttctcgata tattcacgca ctcggggtac tgcctggaag cgtattccga 

195661 gcaacaccat ccttcgccat cctttgatac caacgtatgg tacaggcttg agatataacg 

195721 cttgcaggaa gcaatggaag gactcaatgg cgctcgggat acgttcgtct ggtatttcgg 

195781 gataatcgcg gtaatatctc tgcggagtta atgccacctt gtcagcgtac attggactcg 

195841 agtatataga gcgactcagc ataccgctcc ctttcggtag atcccccaac caccatcttt 

195901 ctcagttctc gcctcgaaat agtgtacgcg acgctcggca tactcgccgc gctgccgttc 

195961 cttccagatt ccagtgtact ccttgattag acttgaatgc gttcgaaact cctgtccaaa 

196021 gatgtcgatg tgagtgtcgg tggggccttg ctcatgcggg ccgcagagaa cagggacgct 

196081 gattttctgt tctctcagtg acttgaatgg gttatcattg aatatctggg ctagatctgg 

196141 gacaatatcg tctgcaactg cagggctcag gcaagggatc tccataagcg actctgtgag 

196201 gtaaagaacg gaaggtgtag tctataggga ctgtaaagat agtaatcatg tatgacagaa 

196261 tgaggtgtgc ttaatctggt agatgtcagc agtttcaagg ataggcagcc aggcattctg 

196321 ggactttgca gaaccatgga agcatagact ggacagggct gatgtttatt gcctattcaa 

196381 gcaggcggcg cccttaagct ttcctctcgc atctgtcatg atatcaaatc tggaagtatg 

196441 cttgtctcgc atacgtcagt gcggttagta ccaaacattg tccttctaat agtgacttcg 

196501 atgcaggtaa tttctgctac ttgtggaaac gattctcgcc tgttttgtac tgggagtttc 

196561 ttccccgaat gttaatgata tacctggcca tcggatgagc taaagatatc atataggtgg 

196621 ccgaaccaac ggctccgatg cgcagatcct tctgtatgtg ataaattgac cttggcatcc 

196681 cggccgatgg atcagccatc ggactttatc tgctgttgaa ttacatcacg ttttggctga 

196741 tagtcaaact gcttcatttc atgattcgag ttgattcttt ttccggcgat aaccttcggg 

196801 tcgattcttg tttattttga ttgacggtat cgcccaatat ctcgtccggt ctggagtatg 

196861 atgctgaaac ccttcgaatg aacgcggtca gttacgcaca agcacgtgtt ttggcctaga 

196921 aaattcacac gtgattatgt ctggcctatt ggctttatag atcaatcttt ttttcattgc 

196981 ggggataatt gggtcaagta tttagcgatc atgccataca tggataaaga tatgtggcaa 

197041 taccattata cagtcttcca ggaggtatca catttacgga taattgttat tattccactc 

197101 aaattctacg tctggacatt taaactcctc tgacaacacc tctacctccg tgtgaaaagc 

19 7161 tgtaacgcaa tgccttcaag cgccagattc tgaagtcatg gataaaataa ccgctgaagc 

197221 ccgtgaccag tcaattcccc aatgcagcgt ccatcaacta cagctcatca gagcgaccaa 

19/281 cagacagcgg acccttcttg accatgatgg taagaacctc atagagaacg tctgcagcgg 

19/341 ccctgttatg atgaatcatc agtaaacaag tcatagggta cttgatgcag gtttgccata 

19/401 catagtagaa agctcagcat ttgtatcata agcaggtgca acttcaacga tgtccgcacc 

197461 gatgaagttc aatccatcca gtccacgaat gatagtgcgg agctcacgcg tcgaccaacc 

197521 tcccgtttcg ggcgtcccgg ttgccggtgc atctacatat tgttgtcagg aaagccatgg 

197581 gaatgcattg ttggagatga cctacaagca ggatcaaggg tatcgatatc aatggagagg 

197641 taaacagggt tctcggtacc aaccctctcc cttatcttct taatgatgcc atccgttcct 

197701 atcaaagtgc atgttagtgg agtacataaa aggtgactgc acagatgtac caatagtatc 

197761 aatctctcta gcttcaacga tttcgaaccc acagtatccg tcgttctcgt aatcggaggg 

197821 accggagagc gtagtgcgaa taccggcgtg aatgttggta tcattcttca gcaggccttc 

197881 catcgcggcg tggtagaagt aggtgccgtg gttaatcgcg gcgacctgac taggtgagcc 

197941 gccgaatacc tttggcttcc aggaatccct agggcaaagt aagcattggt ttctccggtc 

198001 caggagtcaa acatacaggt ggctgtcaaa gtggataaca gtaacaggtc cgtaggccct 

198061 atttatactg cgcagcaacg gcaaggtaat ggtatggtct cctcccaggg tgatgatgcg 

198121 tggaagggtc tttccagcct tggagagtcc gtacttctca gcatcggtga atggtttgcg 

198181 cattagaaca ctattgtgac cctcttcgat ctgttggata gcccaggcgt tgtcataact 

198241 attcgcatca gaattttgct atcccacccg tggtgtggga accatacgac gttacaggga 

198301 tatctccgca atccaggact ctcagatcac tgacaaaagg gttcgcttgg agaggcacat 

198361 tatacccgcc actgagaaga attacggatg agcatcaacg ccgtcaggga tattcatggc 

198421 agggtatagc gtactagagg ttgaggcggc gagatccttg tcggattccg ctgggtccga 

198481 atcttgctcc gggtctatac gatgttccag tgtcaaaggg agcacctgtt tagttagtat 

198541 gtatgattat tgtcttgttg taatactgga tgaagagggg gggagagagc agcataccga 

198601 tgaatgcaat gtcatatctc tcggcttcgc tggagagaca aggaaagtaa ggcagccggc 

198661 caaaggtaga gatgccagag aagacagagt cggcctggct tgtcagccaa tagataagta 

198721 tgagctcaat acgaaatcac tgacctgagt gccaccatct ccggggagtg tattgtacca 

19 8 781 caacctttgg tgcggtccag ccagaggagt ctggtcatga ctgctgtggg cagttacgat 

198841 acccgagaga gcaagtaggc tcacaactga aagatacatg ttggaaagat acctgggaga 

198901 aaaacgtagt atcaatttga ttgaagaaaa tgatgatcca ggcaggcttg cagggagggg 

198961 ataattatat gcttccatct gcgtctccgc tagttggctt ccccaaccat ctctccttcc 

199021 agtccccgca tgccctttgg tttcaacgac ccttggttac accgaccttt acagtacgca 



199081 cgttgaagca gagtatggat ggacagaatg gcaaactcca cggtatgatc cctattgatg 

199141 acctgtctgt ccgaagtcta tactcgagta ttggatatct gactatgtcc actcacagta 

199201 ctgaagatta cagtctacca attccttctc ccaggacctt ccagaatgag tgaataactc 

199261 gagaacctgg gaaaaccgag tggggtacga catcatcaga tgtgcatcat ctcttgaggt 

199321 gttgtagttg tggggttgtg ggtatcatgc gaagtcgaga tctgttccat gggctaacgt 

199381 cagatgcttg gccagtagta taaattcgaa acgcatgaac gatcaaaaca ggaaggccgg 

199441 aggagaagaa agtggatgct tcagtgcatc aagatcttac actgaagcct ggcactagtc 

199501 tcatagtata attctaatct accggggctt atcgtgctgg tgttccacgg agatagcggc 

199561 cggcatttct tatcagcaac gctgatcttc attctgtgac gtggggacgg ccgaattcca 

199621 gaacactact ccgtaaattg acaaccaaga taattcgtac gataagcgtc cgataccgtt 

199681 ctcgcttaag aattgaacaa agagctgaca acgctaaggc tacccattca aactctctcg 

199741 accttccttt tggagatact agatagagac cctgcgtgga ctttgcatgc ttcatacttg 

199801 atctagttta tacctatctg cattgcatat tacacaacag tcgtggcacc attggtggtg 

199851 tgtccaacct ccaatcggtc atggatatcc attcgctgaa acgccttatc gacctcgcct 

199921 tcccagttac tgaaggccac gattgtccac tccagtgccc ccacaagata caatgcgatg 

199981 cactgtccca cccaaagacc tttcagtccc catccattga aggctagcca aatacctaga 

200041 ggtagtgcac cacaataata actggccagg ttgactagcg cgccgacatg ctgtcttccc 

200101 atacccctaa ggcttcctcc acagctaccg ttcaatccat cggcgatctg gaacaacgcc 

200161 acccatggta gaacctccgc agtgagctgt acgacccctt catcggagtt gaatatcttt 

200221 gcaaagacgt gtcgagtccc catcagcaca gccagcacgg cccctcccag cgccatactt 

200281 agccaggccg cggtgttggc cgctctcgat gcaccagctg catcccggga gcctagcagg 

200341 ctccccaccc gtgaagatgt agcgactccc acgccaaatg ggattgtgtt gagcacttgg 

200401 tctgcggtca tgatgacgct ctgagccgcc agtgggattg ttcccagcct tccggctgca 

200461 agagctacta tctcaaaggc ccaccactcg gtgcctacat gcaccacgcc cagaaaggcc 

200521 aatcgggcga aagtcccaag attctcgaat gctttgcgcg accagccgcc ccagcattcg 

200581 gagccggcaa tgaatcgggc atacaacacg agcagtgcaa aggacagcca gtaggaaatg 

200641 ccggtggcaa atggcgcacc aagcaacccc atgcgaaagg tgtaacagaa caggtaattg 

200701 agcaatgcat taaacggcac cgtaatcagc aagacatacg tcccagggcg cataatacct 

200761 gtatcctctc attaacaaac ttcccagaga tatgatgtct gagaagcacg tcggaaaaag 

200821 acttaccctg tgcttgcaga tattttttca tggcctcgaa gtagatataa cccagaccgc 

200881 cagggatcaa gcaagttagg aaacgcgcac tatcccgtga aagctggggg tcctggccaa 

2 0 09 41 gcagaagaaa cacaggctcc gagcatgtcc aaagaattgc aacggggaca tagaacaggc 

201001 cgaggacaaa gaatgcccgc tgcaacagaa ttccgagatc atgcttattc gagctgcctg 

201061 tgaacgttga ggaggctagt gtgtccaatg cggtggtccc gcctaaggcg atcatccatg 

201121 ccgtgaccat ggcaaacatc agtgagaaag ctgtggtggc caggttctcc ggagaggtcc 

201181 gtcccacaat caacaccgag gtggtttgta ggctgttttg cagcgtgtat gccagaatga 

201241 ctggtatgga gtccttcaac agccggaaaa attcgaccag aaaggaaacc gacgattttg 

201301 cgcttttgga gtcggggtcg gggtcgacct gcttgggaag tagtggcgta tgttcctctt 

201361 catgagtgtg cgcggaagag gtctcccggt agctaggaag agagtcgtac attatgcaat 

201421 gatagtaggc acccagggct cagtgtttag aagaacgaag tgttgtttac ctcggggaca 

201481 acatggacta cgatgctatt aaagccttca agtttggaga acaacatagc tgaacgcatt 

201541 caaaagcaca aaacagcccc ttacctcgta gaagcagtct gttactcaaa tggcaaccgc 

201601 gcagaccagc aaatgatcca ccacgagctt atccagatcg ggatcactga cccaccagga 

201661 gaacttagtc gagctgcatg tcagcgcaga atgatccaga ggcggggctg cctctttctc 

201721 aagatattca cctctttggg acgagcgtgg gggactggac aggaaatgcc atccaccctt 

201781 gcatgatccg atatcgaccc gagatgtcga gactgacatc atggtgtcaa ttcgaagact 

201841 gtctagcctt ttgatgccca atattacaga ttaagttgtc tacttcttac gcaatattag 

201901 tcccttatct gggatgcaaa tcaggggcta tcaagcatgt gcagcatctg atactgagtt 

201961 gtactcggat ctatttgagt cccgccgatg tatggttccc cagacggggt ctgactggct 

202021 tagcttcttg cacttatctg ttgcaagaaa gggattgttg actccctatg cgggaagaag 

202081 aagaagaaga aaagaaagag tgcgtggtac tgaaaagaga actatgactt caatcgcctg 

202141 gattatcagg atttacagag ataatccacc atagatttgc gatggccata atgggcaaca 

202201 gatatgatgt gagcatcaaa atattcatgg aattgctaac gtagaggtcc tagccagtca 

202261 aaatagcaag caatgtcagg agaatgcata tactgcttgt aaatagaatg tagaatacac 

202321 atacttcgtt accgcaacgg gttaatgtga ttcgcttcca atttagccag aaccaggccg 

202381 agaaggctat aatgaactca ttaagagcag gggtccgatt cgattcgctc gtccatccta 

202441 atacatcccg ttgatgataa gcagcagaga tatccttcag gctggtgatt cgatgtccgc 

202501 ttgaggctgt tactgataag ccttgtttca tactgcaatg gcatcattca acttcacctg 

202561 tcgactcccc tcccctcatg gactcttata tatgcacctc attcaaaagg accatgcatt 

202621 gacaaaattg cctcatggct atactaacgt actcataatc aatataatta cacacaatga 

202681 atctcaagct tacccagata gaccttgtcc ctgttcccga gaagcaatcc gccagcgaca 

202741 tcgacagcct ccccgttgag caacatggca cgcacaccca ccccgcgcct acttctgacc 



202801 ctctggatcc gcttaactgg ccccgttggc gaaagcatgt gattctgggt attgtcatgc 

202861 tcaagtttgt actacctgag cagtccagaa gaactcacag attgacaatg aacaggtact 

202921 tccttttcac ctacatcaca accaccactg tcccctcctt cgcagaaata caatcacagt 

202981 atgacatcaa ctactcacaa gtcaactgga ccgttgccat cccagctctg ggcctatcgc 

203041 ttggaccact cttctggtcc tcagttggtg acatctacgg ccgccgcatt gtcttcattg 

203101 tgggaactgt catcgcttta gttgctacaa ttggcgctgc tgtggcagac acatatggtg 

203161 ggtacatggc tgctcggttc ttccaagggt ttggagtcag cccttcatca acagtgggga 

203221 tggcagttgg taggtcactt ctcttcagca tgcattgcac tgatccaagc agctaacggc 

203281 acagtcactg atatgttcta tgaatacgaa cgtggccaga aactcggctt atgggtactt 

203341 gctcttgatt cgggtctgct tctaggccca acctgtacgt gagcccattc actgaatcat 

203401 tatgaactcc acctattgac gttaatcaag tcggcgggtt tcttaatctc gtcagcgcac 

203461 aatggatcaa ctggttcaac gccattctct tcgctgcact tctactactc gaactcacac 

203521 tcatgcctga aaccctctac ccgcgcgctt tgatgctcca gcgcatggca gtcaccgaga 

2 03581 agccagcaga gtcgaatgca ggtatagaag aagcagggat aaaacgcacc aagtcgcttc 

203641 cgttcttcaa cctccgtccc atcccaggtc tctctcatcc acctatctat gcctctctta 

203701 cccggttcct ccttacattt cgcttcccag tcatcgccgt agctgttatc gggtactcat 

203761 tcacatggta ctggtggatc ctgtccgtga ttaccatggt cccgtctgca tatgcaacgg 

203821 actcgcctct catacaggga ttactattcc tgggcctact gatcgggacc ctagtagcag 

203881 aggtaagctg ctcgggccga ttgagcgatg ccatcgtggg gcgattagca aagcggaacg 

203941 ggggagttcg ggtcccagaa atgcgattat ggctagcata tccagccatt gtcttgactg 

204001 ctggtgtgta catccccata gctcatgtgc gtacaatttc taacatagct atacaaccgg 

204061 cagttggttt aatcctctgg ggcataagta tagacaaagc ctaccactgg atggtaggac 

204121 aggttgcatt cttcttatgt gagtagatcc ctccacctat ccactcctcc tctcctctcc 

204181 ttcccttgca attaccatct actaacctca tttactcatc caacgagcag tctccgccgg 

204241 aatccaaatc ggcaacaccg taacaagcag ctacatcatc gacgcctacc cgctccagtc 

204301 caccagcgtg gtaattttct acgctgtatt cctgaacctc agtgcgtttg ctaatccagt 

204361 atgctccctt cccatccact actcttgctg cgatatcctc aaccaccacg agtaatttat 

204421 taacatatga acagttcttc atcgcccaat ggcaagccgc aataggctgg acctggacct 

204481 tcaccaccca agccctgatt gtcgtggctg gcggatgtag tgttttcatt tcgctgcatg 

204541 tgtttggggc ggcgtggaga gcgaaggggg tgccggggtg ggtgaatccg gagtttgatt 

2 0 4601 cttagtttga ttcatttttc ttttctatct agagaatcct cgaaacaaga aatgttgcgt 

204661 tacatgttac tgctgagcag aaagttacgc atcgcatcgc aggctttgca tgtagagggt 

204/21 gccattggcg tcaatgtcca aagtctctaa tgaacactac gacgtggagg cgtaggtagt 

2 0 4/81 atcagtacag aaaagcgaat tattatttac ctctcctgct ttaaggagac tcatgcagca 

204841 tgcgcagatg tatgcatata ttcaataatc gctaattacg atgaatcgca tgcaagtacc 

204901 aagcaaagtc acgagtcgtc caaaacgtct gctccacgaa ttaaataaca aaacagaccg 

204961 tctatcacga ttgccatttg cctggtatct tcccattccc accaaacgcc acccaaataa 

205021 gagaagaaca acattttgaa cacgatacgc cgccgccaac gtgatgagat cgacaacgca 

205081 gaaaagaaaa atgctataca tgaccgaata gcttcagtcg atagatacac gtgtgattgc 

205141 cgccccagtt cgacgtggcc cgaaatacaa ccctgtccac ccgaacttct ggggagtcga 

205201 tcaccgcgtc tagttcgaat cgctgcacgt ggtgcgagcc gtggatatcg tatgtgaatt 

205261 tcccgactcg gtaatatgtc ggtcctagga gcggatcatc agagtacgca gtcggatcct 

205321 cggtgggcca cgcggaccgg agggcgtcca taatcggctt gtggatggaa aatcgctccg 

205381 cttgcgttct tgcaggtcgg ctggttccct tctggacgag tacgtactgg gcccagagct 

205441 ccatctcccg gggggcattt tcgggcttca atgtagccgt ctttggcatg tgttcgacgg 

205501 ctacttcctc cggtacaatc ttttggccta ggtcaatgcc aatctgcgac atgccatccc 

205561 ggggtgtgct gcaccagcaa tctccataat cttcccaggg aaccaaagcc gcgagcggtg 

205621 gtgatgcggg acgataatga ctacctctcc atagattaac atatgcccac tgccaagcac 

205681 tgagcttgtg tccgactgtt ggacttgtga gacctggaat aacaataact cctagtccaa 

205741 tgctcagaaa gtttgttttt tgctccgttg gaggaccaag atcggtgctt cctctgcttg 

2 058 01 gcggcatgat cggagtgggt gatgatcgca cctccgattg cacctcgttc acatcccatt 

205861 tgacggtgcg catttccttc gccagtgatg atacctgcgc cccaattcga gacatctggt 

205921 ccgtaaggct gctgactgct agcatgttgt tttcatccat cggaatatac tgtggcggtc 

205981 gagaaaggga gggaatgccg ccaaaggcaa ccaggtaaac tgtcgctgct gccaaagcta 

206041 ttcctgcgag tatacgagcg actgtagata cgctgggcag ctgtgctgat agacttccta 

206101 cccaggacca ggactggcct gtctgacgaa tattttgctg gccatcatcc actgatggtt 

206161 cttgtgatgg agtcctccgt gcatatgcat tagggataag tgtctttgtg ggcgcggtaa 

206221 actgggctgc aggagacttc tctctgataa cctccggctc cggcgtaggc tctggcatag 

206281 gctccagctc tggaataggc tctggctccg gcgtaggctc tagttccagc tcttgcacag 

206341 actcgggttc cgactgtggc tccggctctg gtgaaggctc tatctggaag tcctcgtagc 

206401 taggctgtag ggagggcgat ctcgattttg ttttcagaag ccttgagtcc ggctcttctt 

206461 cttcgtgaat aggctcgttg gcaaaagcca gtccggaaaa tcgacgaggg ggtgccttga 



206521 tattctttcc atcggaaagc gtccgtgtca attcatcgga gttgatggct cgctcaacgg 

206581 tgaacgaata taacgacgag tcgtccggta gcggggagct tcctaacggt ggcccgtcgt 

206641 gcaattgtgg gccaaaccgc cccataggag atggatacag gttttcggct ggtacgtggg 

206701 gatatctggg tacaggtaga gactgcgcag gacttgaggc ggtagaaaga gtgtgcggga 

206761 taggaggcgt gggggtggcg gtggattggt cgttgttgct gtagttgccc cgtacaggag 

206821 taggagactt ggttgcctcg cgcaggttgt cgagaagctg caactcatcc ggggtcggct 

206881 cccgtcgact ccgccgaacc ggggaacggg taatagaagg ggacttctgt cgacgagatt 

206941 tgtcctgggt ggtttctttc tctttcatct gtcgatcctt agcatcctcg attcccttgt 

207001 cgatcgcgtc ggccatctcg gtcagatcca tatgaggctg aatcttgagc tcccgcggga 

207061 ggattggcgt ctcagcagat ccataagcga acgactgctt ggttggaagg tttggcagag 

207121 cctgtgtcct tgcttccgga cccagtttgg actgaaaata cgtcgacgca ctccctatgt 

207181 cggagcgagt agaccccgcg cgcctggtgg cgccgcgtct agcaggcatt ggaagttcaa 

207241 tgcggagaag aatcggcttg atgtttctgt tagctagagc gcataaagta cagctttgca 

207301 tactgagggt taagggaaga gaaagtccta tagcagtcgg actcaaagct ctctactcac 

207361 tgtgagaaga tggaactgaa taagacgctt gcagccttag gcataaatcg agggctggag 

207421 atgccaaagc actgcgatga agtaggagag atccagttca gagagcgcga tcaggaataa 

207481 ttttgaacgg gcagtctgca atcgagcttg tgctacacta tggctgcgac caggatgacc 

207541 aattcagatg tatcctacac gaggtacgtt agagatgaaa tgaagaacaa gtgaaagaaa 

207601 gctacctcct caagtctttg ttcgcgaaag accattgaga aaagtcaaag acttcgaggt 

207661 gcccggatga gctaaggtga gaacaggcta tagactcacg atagcaatcc ggggtaattg 

207721 tcgaccttca agcgcgaaag cgacacggcg cgaaaagacg cgggtggaat gccgatccac 

207781 agcgtcgcag gggcggttaa gggattcaag cacgtgatat ttaggataaa ccaggttgcg 

207841 catgtatgat gcgcgtcgta caaagattcc tcagtgggag atgatgaaga gagagaatgc 

207901 tgaagccgca aaacaatgct agacgacggg agatgtgagg cttgcttgct tgcttgcttg 

207961 ctggactgaa tgatggatgg cccgaacctc cactcgctgc gggtaatatt cgtatctccc 

208021 gttacccgcc ttcaatacag gattgcccga ttcggcgagg cggctcgtcg gcctccttta 

208081 tagatcaccc tttggttctt ataaaagtta tcctctctca cgcagagtgt atagacctta 

208141 catgacagga ttattgcatt attgcactgt tgattctttg tctcagtcta ccatttgatt 

208201 gctatgctca tcagtccaac caggccttta tttcttcgct atttttatgc atgtttcatc 

208261 gtacaaagaa aaatacaggt cctcgaatgc ttcatttaca cggctgccct cccaggacgc 

20 8321 ccatccttct atattgccaa agtaagaata atatacagtc gctgaggcag tcaagcagtc 

2 0 8381 acaaatacag aaacatgatg atgatagtta gcatgtccat ggcaagtctt gcaggtcgta 

208441 taggcttcgt ttaacaagtc cacgccgcga caataccgtc cttgcggccc tcaacaagga 

2 0 8501 atccatcctt gagccggacc cgctcaaccg tggccggctg cggtccactg tcatcgacgc 

208561 caaccccggc gccacagtcc cagtggtggc cgtccgcctt gtcgtatacc ttgacaacgt 

208621 tctctccggc tgccgcaaca atgcgcttag tgtcgaacat catgctagtg atgggcttgt 

208681 cgtacgcgta ggcatcgaag atagatccag ttcggagatc ccagatctgg ccaatgttag 

208741 cggtgccgtt acctctgcat ataaacaaaa cacttactct gatgctgcgg tcctgacttc 

208801 cggtcaccaa gtgtacatcg tcgaactgta ggcatgtaat agggccggta tggccaacca 

208861 ggctccggtg gacttggccg ctgcgaagat cccacagacg caccataccg tccgccgtac 

208921 cgcaggccag cgccgcatcg aaacactgaa ccgcgccgac aaagtccgca gaagcatctg 

208981 gcaagcgtcc cgatgggcgc catgtcgtct cacttcctag agtcgacgcc tgggccgctg 

209041 cccacaagac atccagcgtc tgcacgcagc gccctttgac cagatcccac tgtcgcaacg 

209101 tcttgtcggc agatccagaa atcagagtgt ctcccttgaa gtgaagagcg gttacttcat 

209161 cgacgtgcgc atcaagcgag tatacgtagc agtcttccag ggtggtcgag tgagatgcca 

209221 tggaggcgtc atcagcttgc gcggattcct catcgtcctc gcgccgggtg acgcggttgt 

209281 cgcgagttgt ggtccgggcc cggctcagat cccagagctt cacggatgcg tccatagatc 

209341 cagtggcgac aatgttgtcc tcgatctgta ggcagcggac ggatgcgttg tgtccctcca 

209401 ggaagcccac gcagcggccg acgttgagat cccacacccg caccgtatcg tctagggcag 

209461 cactgatcat ggtgccaaac gggtagtcga agtcaatagc ggtaaccatg tctgagtgcg 

209521 ccggcatctc tttgatttca gaaccgggag caaagtgttc gtgcaaaact ggcatcgatc 

209581 gtttcccttg gaccattagt actggacact gaacggggaa tgctttggca tacttataga 

209641 tcgctgcttc acgctcttcg gcgaagggat cttctggtag atggattcgg acataaagga 

209701 agcacccatg gctgggtctt cagaaacaat cgagtcatcg ttggcctcgg aagacttggg 

209761 agttgcggga ggcggcacct cggcctcttg ttgctcttct tgaatatcct ccagtttgtt 

209821 ctcaatctcg ttcactggtt ccatgttaga acgtttctgc cgagtatcca agatttcagg 

209881 tcacccacgc tcatgttcca gatccgcctc ctccatctca aggcctgcca atcggtctag 

209941 cacaatcttc cgcatgttgt gtaggttcgc gatcttgtta tctatgtcgt gtatctccga 

210001 gctgcacata ttcttgcgaa cgcccatcag ttccatccgg cggctcaaca agtcccgttc 

210061 cttcttcaga tccgacggag cggacggcag ggcgccgcgc gtctccccat ccttcagcag 

210121 cttcttgctc ttggaagagc gccggcggtg ggcgcggcga tactcgatgt catcctgcga 

210181 cgcctggaag ccctggaaga gagaatatga gctggagtca tcgggaatat tggcgaggag 



2102 41 ttcgtcagga agggaggaga tggcccgcga ctggatttcg gtggtggaga gcttggagcg 

210301 gaccagatcg gtaggggtgg tctgggtgag agagaagact ttgcgctgag tgttgggatg 

210361 gcgtaattcg cgctgaatgt cggccatagc agtgtggtag tggccgccgg tggaggagtc 

210421 gggggccggt cccatcaaat ggccagcagt cgatgtgact ttgcgcccga aggcctcgat 

210481 atgacgcgac tggagaggaa gaaggtcagc ggaagggatt ttaagcagga aagaccgccc 

210541 aagggcaggt gtcaaggccg caagagcata ccgtgatgcc ggtccctaag agaccgtccg 

210601 gctcgacaat atctgataga ccggagggag actcatcacg gcggcggtgt ttgtccatga 

210661 caccggcgaa ggagcgaaga gtgagctaag cacaaactcc catccgggaa ttaacagatc 

210721 aaggacaagg atggcggaga aagggagctg ttgtgcccag tcggcagttg gtcggcttca 

210781 ggctgggatt ggaaaaagaa aggcggatgg agttgcacgc gatgaattac cgtgtttcgg 

210841 ctgtatttta cagcttgcct ttgctgcgcc cggctaccac gggcacactt ctctcagctc 

210901 tggcttgtct ggcttcaacg tccgaaaaag catattcgtt cccctggaaa gagggtgttc 

210961 cattggatat ggctggctta agtatccaaa tgtccctgtt cttttgcccc ccagcccctc 

211021 cagctttcca cctgatgagg ctctccttct ctgcacatga cagcatcacc agccacggat 

211081 cccgctgcgg gttctgtact gagccgtaat gttacctgcg gacaaccaga atcatgaatc 

211141 accatccaaa tcatgggttg cagatgatgg ctgagcccag cattccatga caatccatct 

211201 gtcgtgagac ttctttgctt ttttcagact caacaaatag tcgcccttga agcatcatcc 

211261 agcttaggcc gatcagccaa acgttagcgc tgactcccca gactacttaa gttgctctat 

211321 gatctccccg aaagcggagc gttttcatct cgctttccca gtccctcaat ctacttctac 

211381 ggttgctgca cttctgcccg ctacgcactc tacaacgctg tctcgatttc caaccagcct 

211441 cgccatggat acagccattg atctgtccga tgcctccaag gccttggacc tcgccaacat 

211501 ccgcttccag ttaatgtaag tgtcttccga gggatttcag tagttagtta gcctactcca 

211561 gatcgaccga agatcgggga accaactcgc atacaataat tcacatccct gcgcgcaaag 

211621 ttcaaccccg catccccctc cattactgcc agggatataa catcaatgtc taccaacaca 

211681 aacacctaaa atcaagcaaa aaaacaacat gtcagtaact aatcatgcat cgatacaatt 

211741 agccgtctcg aagacaccat caccttccat ctcattgaac gggcgcaatt ccccctcaac 

211801 aagcccatct acctatccgg cggggtaaaa atccctggca ccgacctcag tctcctcgac 

211861 tacttcctcc gcgaacaaga acgtctggag tcgcgtgttc gtcggtacca gtctcccgat 

211921 gaatacccct tcttcccgga ctcgctggaa gagcccatcc ttgccccgat caagtacccc 

211981 aagatcctgc atgacaacga cgtcaacgtc aatagcacct tgaaacagcg ctacatcgag 

212 0 41 gatatcctcc ccgctgtgtg cgcgcagttc gggcgcgaag atcgcggcga ggcgcaggag 
212101 aactatggat ccgcagccac atgcgatgtg cactgcttgc aggccctgtc gaggagaatc 
212161 cacttcggaa agtttgtcgc ggaggcaaag ttccagcagg agaccgagcg ctttgttaag 
212221 ctgatcaagg cgaatgatcg gaagggcatt gatgacgcta ttaccaagcc tgaggtggag 
2122 81 ctgaaggtgt tggagcggtt ggcgctcaag gcgaagacgt atggtaccga tccgggtttc 
212341 caacctgaaa atggacctaa gattaatgtt gatgctgttg tggccatgta caaggtgagt 
212401 ttacatcctc cccgcagttg atagaatcag gatcgtgctg acgtgtgtag gaatatgtca 
212461 tcccgctcac caaggtcgtc gaggttgact acctcatgca gcgactcaag ggcacccaat 
212521 gggagtagtg taaacaccta gcaactccca ttttgcaaac tatatacacg gaacatgaca 
212581 cgagacgtcg tgacgaaact tctatagaat taacccaaga tggcataata gaaacggaca 
212641 aagatcaaag aggcgaaaaa acaaagaaca aaccaacttt ttcgggtgtc tataacatat 
212701 aatcatcgta atcaaaggca aattatatct atggcttaaa gtactccgcc atatctttaa 
212761 ccgccccatc caacttgctc gcctccaaca aaatcaagct ccccttcgtc acaataccca 
212821 aatcctggaa tgcgcgcttc gcatccgaca gtccctcctc cgccttatgc gtatactcca 
212881 caggcccaac acacacggca tgcctttcca gaacctcatc ctcaggatgg aagaggaacc 
212941 gctcggcctt ctccccaccc gcagccttct tcttcttctt ttgcggccgc gattcctccg 
213001 catctagctt cgactcgacc tcttcgtacg tcttcgacac gatcaaatag tgcgagaaat 
213061 tgtaaggctc cttgtccttg atcgcccatt cgatttcctc ttgcagcatc gtgtacatag 
213121 gcggtacgac ctctgcgggc atgttgatga gacgttccgt caggatcagg ccgatgggag 
213181 ggatgggagt ttgggagagg agttgagcca ggggtgcgag tgttggtacg gcattggctt 
213241 tgcgctggag gtaggctgtg aggtctttga ttacgggttt gtcctggttt ttgttagatt 
213301 ctcttccttc tatccttatg ccaaggagaa cggagacgca ccttgtgctc ctgaaggttc 
213361 aacacagtca agaacgcata cggatccgac tcattcccat ctactttaac agtcgatcca 

213 421 agcagaggtt gcgagaggat caaatcggcc aatgcggaca tgtcgaagat ctgggcgtcg 
213481 gtgtcgaaga gctggcggag gaggttcttc agaccgtgga agtcgacagc cggttgaggg 
213541 tcgaaccatt cgaaatcgac gttgaccatg tccatgtcct gactcaatgt gttaattcta 
213601 ctttgcgtat ttcaggttgt tgcttggctg ggcttgtata ttgcttggag gggtattggt 
213661 atgtatgtac ctcgtcggag tcgtcgccgt cgacgcgggg atccgtcccc gacatctcga 
213721 catcattgtc tttgagttcc ttgcgcttga ccattgtgta tgttgggtat gtgtggtgtg 
213781 gtgtgagggg ttgtgctgga gtaaattggg attccgttcg cggaggttcg tcgggcggcc 
213841 ggccgacaaa agttgatctt ccgctaaccg cttttgatta tcgatagctt atcgataccg 
213901 ttatactaga cctgatacat cattgaaatc aattcatatt cttactttga agcaagctat 



213961 gtaggcactt cacactgttt gaatgagtat tctctagaat gccctattta cccccgcatc 

214021 tcgcatatag gtctacacat cagaggaata tatagaactg aagaaccatt caatgtagac 

214081 agacatttca ctgaccctgg cctactacta tataaatttg tcctgcggcc ttgtttgatc 

214141 ttcctaatct tcccaactcc aaccatctac gtcatataat gggagaaaca tcgtagtact 

214201 tatataattg aaatgagata tctcaatgat ctagatattc gataatatat gcaagtcgca 

214261 gtgagaccaa ttacccatga ataatactaa tatattggca acagacttga tagaaggaaa 

214321 ggagattgtc aaagcgatgg agcttagacc agacaatctg atcagaatat attgggctct 

214381 gaatagacca aatagagttg gttcatcatg ccttttgctc agaatggaga tgcagtaacc 

214441 aagggtacgt caagtggagg ccatgttact ccttcgaata aaacacatga gtgaagactt 

214501 cgcagcggac taccaagttg aagaaggcat caaatatctt agagaggtaa agacgacact 

214561 acatccgttt gttgtgaagg ccccatccgt tggaagctat acatgaacag gaagctctaa 

214621 tgcaaccgct cagcccgaac ggaaatgtat ctgcaggatg tcattctttc tgcgcaaggg 

214681 taccgaaaag cacaggtaca agccaaaata gcagcataac tccttcgttt ggatcctgtc 

214741 ttttgggaaa gaatccttgg tatctggaac agaacaacag aagcaaccca tacagtcaga 

214801 actcgagagt accagagcct ggaagagtat ttgtcctatc gagtgtcgga tgcatatgtt 

214861 gagtaagcac gcaattgatt ctgcccgctt ccactgctaa acttctcagg gtgggcatta 

214921 gatctgttta gctgggcgca ggacattcat ctgaatgatg atgaagagca tatacatagt 

214981 agaccgtgtc gaccaatctc agcttcaaga taatcttggc tgcctcctcc gagcaagact 

215041 ggcatcgcat cacaacctga acagcattga caaggtccac tgtcaggggt tgtccctcgc 

215101 ctgtgcttgg ttacgtgcag ggttcagact gctgctgacg gccatgttgt tctataataa 

215161 tatgagagac ttcctggaat ataagaactc ctcctctggc ccaacgtcca agtgaatact 

215221 acaatgaatg ctgcgcggat gaaccaggta atagatacta tgcctggcag tgataataag 

215281 aaaattcttt gcagctgaca gtttctgagt aactatatac ttaataaaga gaatgatact 

215341 aatatagtct catatcaata ctgaaaaatc ctgatatggt cgatgatcag taaaatacaa 

215401 gaatcagtac cgctaattag caaagtataa agaattagag aagatatatc tccggccttt 

215461 aagagaatta aaatatcaag cacaaaggtt ttaagatata taaggtttat gagactgttc 

21bb21 actgacataa tatatgtatt ataaatgaag agaatattaa aaaggcggac ccgcattatc 

215581 aagaagttca cccgtgaccc tgaccattgc atggtgcttg taatgagtta tatgataagt 

215641 tcggccggac tcaatctgca ggcgctgtgc cataaccttc atctcttcac cggctaaaca 

215701 tggatttagg ctcgatgaca ttcaggatga gagtgcactc cgtcggggca aaccccccac 

215761 tcaacaagac attcggtgtt gatcaaacaa tcagttgcgc aacctttctc ataaacgagg 

215821 ctttgtgctt agtccggata ttacgccatc cccacatcgg agaaagccat gggcttcatt 

2158 81 ggaggtatca tgtcaccgtt ccctccctcc aagggaatca tgtccttatg atagacaaaa 

215941 ggaaatgaag tactgcaaca ttaaagagat ccctcccact agtaaatcag cactccgctt 

216001 cttttcatat acgtagtatg acctcgctac agtcccggtc tctatactta ttctaccaca 

216061 ggagtagtat atcatcatct ctaaagtcta tccaacatac atactatata caacaacccc 

216121 aatcacctca aacatccccc tcgacctcct ccacactcgg caccttatcc aacaccctcc 

216181 cataaaaccc agcaacatgc ttaaccccct cacacaaaac cctcacatcc tccaaactca 

216241 cccccaaccg tctcgtcccc cgaatatgcc aatgcgtctc aatcttcaaa ttctgactca 

216301 taatcgccgg caacaccacc aactgcgtat ccaaatcatc caacaccccc cgatcactca 

216361 aaaacaaccc atacgtgatc tccttcgaca accacgcgaa atccttatgc gccgcaaaca 

216421 aatccaacgt cccggacgta ttccgcgcat acagcttctc caaccacgct acgccacgct 

216481 cgtgattcgc ctcgtcgcat tgccagttct cccgtgtaaa tgtgtagtct ttgtcttcgg 

216541 gagcctcgag cttgctgatg gcgagaatgg cttcgatggg tttgcatacc ccgactatga 

216601 tcacggattt gatgagggct tcgcggatgc ggcggatgag gtcttgccgg gatgaggggg 

216661 tcgagtattc gggtttggag gttaggtaga gatagagttg atcggcgcgt tcggggtcgg 

216721 ggttcgcgac gagggtggag atggctagga tgtaccattt gtcgtgcggg agggtggtgg 

216 781 tttggaattg ggattccaag gattggaaga gttgggcgta tttggggtcg atggttgtgg 

216841 tcattttggt gggggcgctg ttgctgctgg tggtgatggt tcggagggga gtagcgaagg 

216901 tgggaatgcg atgcgcgaga ggggaagatg atggtgttga tgatattaat gatgatcgga 

216961 gcggcctgta tcggggcggg gagaggaggg gtaggatgga ggcaccggtg ccgaattgcc 

217021 gtttactagt agtcaattgc agactacgag acggagggag cagacgcgaa gggttgcggg 

217081 ccattatgca cccttgaaga tgcaattatc cttagtcgtg gaggtgagtc gcgtgtatct 

217141 gtttattctc catgttagcc ggaaatggac ttatcagccg atatcggtta tgtaagagtg 

217201 catggcatgg ctgagttcaa taatgctcct tgacaccacc ccattcagga acaccaatgt 

217261 ctcatctggt tgcaccccaa gacttcaccg aaaatgcaac accagacgat ccaactacat 

217321 catcgaagag gaatatgatg aggctgttcc agccaaggga caagcatgcc cacctttggc 

217381 tttgacccac acagccatgg gtatgatctc agcgactgcc atatgacatc acaactatca 

217441 aacccagatc ccaactctac tctatataaa cgctgttcat cctcaagcaa tgtaactcta 

217501 ctacatcaaa caacgaacac tcatcgaatc tctctctctc tctctatata cagaaaacaa 

217561 ctcactccaa tcctacttca caaatacaca caatgtcttc ctccggagca ggccaatcca 

217621 gcgtcggcag ccgcgccatc tacgaagccg gtgaccagcg caacgttccc cagtccgaga 



217681 tcaacgagca gaaccgctac gcagagggcc agaagaacag ccacaagaac cttgattcga 

217741 gtatgcctcc gtcttccctc ctaccttcca atcactcggg agacatagtg ctgacgaata 

217801 ccaaaacaga ggaccagcgc tccatcggca acaagcttgc ctcgcaagaa cgcaagcctg 

217861 agagcgacca ccaccacaac tttgatcaga accctgaggc tgagctcagc aagcaggatc 

217921 ctactaagcc ggtatgtccc atctcatatc tatatcttcc ttggttgaac cgtgtgggaa 

217981 gcatgactga ctgtctctgc aggccaaggt gcatggtaac gagccctcca agggtgctaa 

218041 gatcgatgcc gaattgcagg ctgaggatga gcagcgcttg cgtgagaagg gcattaagaa 

218101 ataaatgatg taatgatggg ctgggatgga aggaaggatg gaaaagcgat atatatcctg 

218161 ttcgaatgac tgcatgaacg attgacgcct ggggcgatag catatcacta ttattcaatc 

218221 tacgatgaat ttattactat tctcttcaag tgctttttag gggatgtagt atattaactc 

218281 atgtgtgttg aatgtcgtta ctgggaaatg gaggtaattc gttgcggagc gccagggtat 

218341 ccaacaagac aaacggtagg atgctaaata ccgcatccac caatattata catctaatga 

218401 tatgaatatg taaccagaaa aggaacagaa ggatatatac aatattgctg ataagcaacc 

218461 gtgtcgatca taaacagcat ggcgtaaatg gttacccagt gatgtctcaa ggctcagcat 

218521 cgcagtgcga tacggccaat ttcagctttc gtcagtgaac ttggggtcgt catcatgcgg 

218581 ctcggcgaaa acgttcgccc agtagctctc cactctatca caaacttcat ccttcctttc 

218641 ccatgtccgg aagctaatgg cagggtcgca actgatgttt ttgatctcct cccaatacgc 

218701 tttctcaacg tcgtatccaa atcgctggac ggtagattct gccaccagcc atgcccgtcc 

218761 gtccgtgcca tcagggcgga gcgtgggggt ggtctcccat ttcatgccaa atactttttc 

218821 accaggattg aacttggggt agttggcgtg gatgaaaaag gcacgtgggg ccttggccac 

218881 agactcatct tgcacgcgcc atttgcccgc actggtcaac gcataatcct ccgctgggtc 

218941 ggtctgcacc atcgcagagc ccgcaatgcc acccggcttc gtgtgaccga ccgcctgaac 

219001 ccgctcgctc acagcgtaga aaggttctcc gacagcagat gcagcgtgaa ggaaggtctc 

219061 cttatcgcct tcgccgggtg cgccttgcga caatagccgg aaatagtgcg acgggccgta 

219121 atagttgtaa taagcagcga gaagcaaagt catttgatgt gtctttttcg ataccaagaa 

219181 gacgcccgct tccgatgacg cgcgctccga cagcgctggc atttcctgcc gcgagattgt 

219241 gaagtagagc ggggacgccg aggaaatcca aaagtccggc catgtgacta accccttgga 

219301 ggtgaatggt tccgactcaa gcaaaacttc aggcttgtgg agcggaaagc agtccgcgtc 

219361 catccagatg acctcctcga aggaggagaa taaaaccgca aaaatcttga gctgataatg 

219421 ctcaatggca cccttgccaa caatgtccga gaggacaaga cagcgcgctc ccaaatccgg 

219481 gaggatgtca tcacagatct gtttttcata ttcactttta tccttcatat acacctccac 

219541 cggcaatgtg gaccccagtc gtcgaagcat tcgcaaagag gacaagaaca cggggaaata 

2196 01 ggaagccccc gccgtggaca ctatgccacg cgtacccggt gtatgtactg gtcgcaaccg 

219661 ttcagccttg cgacattctt caatatatcc cgcatgagac tcttccattg cctgcaaatc 

219/21 cccatcctct atgattgtca gatcaggtcg ggaatcggtc gtcgtagcgt tgaagtggac 

219781 tgcgctcacg tctgccaggc gcttcggagt cggagtattt ggggcatgtt cttgcaatat 

219841 cgggttcaag accttccaaa aagcattttg acgttctgcg agagggactt cgaagcggtg 

219901 ctagaactac tgaagccgga ggcggggccc aatagatata cacggtgcaa atgagtaaca 

219961 gcgcacccgt ggctgatatt atcgatttca gtttacctgg tcgaagaatc ggcatgacta 

220021 tgttcgctgc ccgggggcag agctaggggg tatggaaata tggctttaag ttggaggcaa 

220081 atctaccagc tcgttgtgta tgaagggagt gcgcttcggc ggtcagaccg gcagggtgag 

220141 ttgttcaagc aaagcgcaga gaaggtcgct ttagcaatcc atcaacatgg ctggcaagct 

22 02 01 cgggtctagt ccaggtacag cgggagggcg ggttctggtt caacggggga caggaaaggc 

22 0261 ggtggaagag ccgcggtagt caaagataag gcacagtcaa gcgctgttgt gttggcaaga 

220321 cctggaaatg ttagaatatc gccaagtttt ggccaggaaa ctttccagcg accacaaagg 

220381 cttgtccaac gtggctcagc aggcagcgat gggggaggcg atggaggcgc cgtggatgct 

220441 ttttccagca ggcggtggca gaacgagcac ttacgttgcc cgccgctatt gataacgcgg 

22 0501 tgcataattg agtatgccgt ggctccggtt ggcttcccac gatcttcacc cggtggcaat 

220561 cccaatatac caagtgcgag ggattttcct attccatgct tgtggtcgga atcttcccga 

220621 ccgatccatt aggggttcca aggcaccgca ccaagcctcg aaccgtagca gaggaaaagg 

22 06 81 ggaggtgaag actgaagact gtactgagta tgtggccctt cctccgcgga agggccagtt 

220741 ccaccgaacc gacagagtgg aaagtggaag ggaaagttca ccagtggatc ccattagctt 

220801 ggtctaccga aaccttaaga agttaagggg aatctggaat cttggatcga ctgagaaggg 

220861 aagactggga gaaggaggga gagggagagg aatgaggaat gagatcctcg gctcgaggag 

220921 agttgatcgt taagtagtag taagtagtaa gtagtagtaa ttagttgagt tgctggctcg 

220981 tttttgtggc ttgttacatt gaagaaggac aatgcaactg cccgctagtc ctgttccggc 

221041 cgtagattgg gggcagcaac tccttccact tcaatctttc tcaactactt gaactacttg 

221101 aactacttga actacttgaa ctgcttgtag gatatctcat ctgtgcaccg agtcgagttc 

221161 gtcacgccaa ctctgcaatc tgcagtctgc actctgcacc ggctatgaac tgttccttct 

221221 cctctgggtc catctgaaac ggtggctcaa tttcgccgac actcggccgc catcaagtaa 

221281 gatcgctcca aaatgcgggc tcgaacacac tcgccacgca ccggttggct gctgagcctc 

221341 gactactcgg cgctaacgag tttatactgg cttaatctcc ggcgctgctc tcctaacggc 



221401 gggtgatgcc gcaacgatcg tcgcttatca tcatctccaa atccgcttct ttcctccagc 

221461 gcgaactaga tccatccccg atgtgtttcg attataaacg accgtaagcg tggtcctccg 

221521 ccgccccccc tctccgatgt tagctcttct ttcttgcttt ttcctgacct gttcatcccg 

221581 actactagat acattgcctt ccacaccctc ccccccccaa caacgtgttt gcgtgctttc 

221641 gtctggccca tccataacgc actagctagc ctgctactgc cgccgtgtca actcgtcgta 

221701 ccacttatgg gaccccgcac tccgggttat ctccacattg tttgcctgga ccagttagta 

221761 tctgcaccct agcagttggc caacctctgc gtcggatccg gctatggaac caacctcgaa 

221821 ggctcgaatc cgcaaaactc cgccggtctc cgccccagct ccccggaaaa ggcgtaggag 

221881 gacggtcgta agccaagcgg ctgatgattg ctttacttgc gcccgacagg gtactccctg 

221941 cgaccggcgt cgcccatttt gctcccagtg cttggacagc gcccgcaatt gcgcggggta 

222001 caagacagcg ttgacatggg gcgtaggagt tgccagtcgg ggaaagcttc gaggattgtc 

222061 cctccccgtg accggggcac agcctgcggt cgcacccccg aagcttcccc gtgcgacagg 

222121 tccacgagca gtgtccaaat cgcaaccggc tcttgacaca gccgcagaaa aacgacgact 

222181 tcatgataag cagcgttcat cagccgccgg gcttacagag catgggtctg tctcgattcc 

222241 acctacttca caacgccgct tgtgctcacc ggagtactac actgtggatt cccatgttaa 

222301 gccgaatcac cataatcctt attcttcacg tgcgacaaag accgattctt tgaaacaata 

222361 ccatgcccca ttggtcgcaa accccgattc atcaagggcc tggatggggt tagaacagac 

222421 accggtaccg atacagtgtc cgtggccagc cctgactccg gaacagaaag ataatgcttc 

222481 ggatgagcaa catgaagaaa cctcttataa cgaggtttca ccacgtatga gtgcttgcca 

222541 gccgccatca ttatctcagc agctgctggc gcgttccgtg ggacgtacac ctcgacttcg 

222601 atacctaatt agttattatg ccgaagtcat tgctccgatg attgtggcat ttgatagtcc 

222661 aaccaatcct tttcgaacat atatcctgcg actggcgcag gagagcacat cgcttcaaga 

222721 agccatcgcg acgttggcaa gctgcaatct gcgacagcga cgcgaacgag gcacgaggtc 

222781 gacggagcga actttgccgg ggcgcctgtc gtccatggcc catcaagcct tgaccgacgg 

222841 agcattacag gaccggaccg gtatctttat gcctgaagat tatgcccagg aagagcagta 

222901 tcatcgaggc atggcggtct ccgccttgaa tcgggaattg gccgactcac atcagaggct 

222961 ttcagactcc gtcctggcga ccttactcat cctctgtcta ttccacgggt gtgatactgg 

223021 ggtggctcaa ttcaagaccc agtttgcggg ggtgacaaaa ttgttggcta ttcgactgtg 

223081 caactcacct tgcatgtcgg atgaactgaa gtggtttgtt cgcatgttca catggattga 

223141 cacgatgacc gccaccacca atgaccgcga ggtacagctt cgcggtgctt gcctggatat 

223201 caccgccgcc tcagatggcg aatggggact ggaaaatctg gcgggctgtg atgccggact 

223261 tttcagaatt gtagcgcagc taggtcgttt gaaccttctg agtcaaaatc aagaagtccg 

223321 cacatctacg ccaccagaca tccatatacc atccactact ctcccgccct cgatggcgtt 

223381 cttcccatcg aacattccag ccacctcgtc cggtcctttt gcatttacct tgccgccgct 

223441 gccgcctcct ggtgatcgcg acagaggaca gctgccgccg gcgttctggg ctgaatggta 

223501 ttctctccga cagaagcttg agtcctggcg atttgttccc caccaatcgc agtcacagtc 

223561 acctctcttt acatccagcc acgcctacat ctcgccacct tcatcgccga cagaccagcc 

223621 ggtggtcgca gctcacaatc tcaaggacgt catacacatc tctgagtcat tccgtcatgc 

223681 ggcgatcttg tacagcgagc gactcgccta tccagaccta ccgtccgatc accctcgtat 

223741 tcaaaacatt gtgcaacaca taatgagtca cattatgacc gttcaatctg acgcgtatct 

223801 cctgtggcca ctattcatca ccgggtcgga gtgcgtgcgt gccgatcatc gcgccattat 

223861 ccgccaacgg tgcaaagatc tatccaggga ctctgggttc ttcaacaacc tctcatgctt 

223921 gcagctcctg gagaagatct gggcagagaa ccctgccgtg gatgtgatcc ctgttgtctc 

223981 atcccttgat ttcggcttgc aggcagccgc cagcgggtgc gtgagcgagg ttacatttct 

224041 cccgcctgga agctgggcct cgtcccagca tgaggctttc atgtctccga cactgccctc 

224101 tgtctctcgg aagcagggat tccgctggca cgaggcgatg caaaccaaac gcgcggagga 

224161 cgagtatatg actgtatgac gtgacctgac ggatacgtgc atcaaaacga cgagtccagc 

224221 atgaattgcg aggaccaatt ataatataga ggtatcatat tagcgaaata taatgaacat 

224281 aatgcaacat tcccccacta cgtacacagt actgtgtgcc ctcagtatct gtctatggca 

224341 acctgaacct tcgggccaca acaggccccc actatcggac ccatcgcact agatccgttc 

224401 ccacttccca cacaatccgc tcgatacgta ccatcacgac agtaccctac tgcgtaaaat 

224461 tctccgtaaa cctactgtca tctgtcagaa ctccatcgat gatgcatcta ctccatctgg 

224521 tccctctagc aacctgcctc accagcccac ccaaacaaac cacaaaagta gcggcaccgg 

224581 gcatctcccg agcgccagta cgtcggaaaa tttgtgttgg aggtgagttg agtttcgggt 

224641 ctgcgtgtca gatgctcatt tgtgggtgtt ggactctgag agggtgtcca cccatcactt 

224701 agatagtact gtgccgctgt ggtgatagaa tgagtgggta agtaattgcc taactcttgg 

224761 agagaagggg tcttgttcac tttgtttctt gaattactgg tttggtatgt cggagggggg 

224821 gtatactgtg ctgcgccagt atactatcta tggctacata gggactaatc actacataca 

224881 gtagttagca tgtagttaag gcacagtagt ataccactat tatagtaata gtagtggcag 

224941 tggcagtgac ggtatcgcca ctggtggact aaatattact atctatagtt agtattagat 

225001 cccactaata gtgaacgctc atgcacatag ccatgctctc tctctacatt attaactaac 

225061 cacatagtta atactatact acatcttaaa taatacctca gtcagatttc gtcatcccca 



225121 tcaccggata gatggaagga agatagaaac ccacaagcgt atcgctaagc gtgactctga 
225181 ctcgtctcca caagacaaga gaacagattt gcaacgtgac ccaaacttac ttttactttt 
225241 gcggatagaa gatgaggggt tcactttaca tcatcaacta gtacacatac agtaatcaac 
2 2 5301 ccataatact agcaggtcat ctgcctctga agaaagacgc ggggggcgtg ggttgggatg 
225361 gataatgcca atgtgatagt gatgagattc atcaccttcg gcggctattc gggggtctta 
225421 tccccccctg gttggtatct ttcccgtgcg gcgggggtga gggttaacta gctacgccgt 
225481 ggagtgtaga acaagggtgt tgttggggtt gggttaggtt ggcggttaag gtaagtgagt 
225541 aagtacatta gtagtaaggt tgaaagtgat gttcggtgga ctgttgtaga ggagattaaa 
225601 aagaggggta gtctagagag agggggggat gacatggggt agagatacta cttacttact 
225661 atctagacta gaactttgat acactgtggt ggtggtggag gaggtatggt atgtacatga 
225721 ttttcttagt ggtatttatg ggagggggag ggttgggtgg gtacatcgtg ttaaatactt 
2257 81 gttgtctatt tggttacttg gtggtatgtt ggcggtgggg aggatataga gggagttgga 
225841 tgtggattga atggtagggg attagagatg atttcattaa aatgatatgg aggttgttgg 
225901 tttttgtatg ttgtgtggtg taggaaggac tggtgtgaat tgttggttct gcgtataggt 
225961 ggtacgctgc gattttgaga gcgtagcaga gaggtaagta gtatatgtag tatggatatg 
226021 tggttgttcc tggtcgcttc tggatcgcat atagtcagtg taacatatat tatggcggta 
226081 aatattctaa caatgacata attctgttat ttcacgaggt tccaaaattt agatggtata 
226141 atatgtgaat gagcaccttt cctgcgcaca ctgcaaaaca atgaaacttc tattggaaat 
226201 tatccaatac cctacaagag tagaatatat tccacgtgca tccctggcat attgacgtaa 
226261 cctatctacc aagcacaata catccatgta cctggttcat ggactgtagg tgtctagagc 
226321 caaacgcaaa acatcattgc agcggttaca ctcatcccaa gtagtacaac cacagtgcgt 
226381 attccttaca tagtaacgta gtactcccta actacctaac caaacctaac ctaaccaaca 
226441 cgatcacgat cctccacacc tgcggaagcg gaaacctgca gacgtgtcgg gcttgttgac 
226501 tta 
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LOCUS (LOC) : AE014133 GenBank (R) 

GenBank ACC. NO. (GBN) : AE014133 AE014853-AE015037 
GenBank VERSION (VER) : AE014133.1 GI:24378526 
SEQUENCE LENGTH (SQL) : 2030921 
MOLECULE TYPE (CI): DNA; circular 

DIVISION CODE (CI): Bacteria 
DATE (DATE) : 24 Jan 2006 

DEFINITION (DEF) : Streptococcus mutans UA159, complete genome. 

SOURCE: Streptococcus mutans UA159 

ORGANISM (ORGN) : Streptococcus mutans UA159 

Bacteria; Firmicutes; Lactobacillales ; 
Streptococcaceae; Streptococcus 



COMMENT ; 




















On 


or before 


Jan 18, 2006 


this sequence version replaced 


gi; 


;24376380, 


gi: 


124376394, 


gi: 


;24376399, 


gi; 


;24376411, 


gi; 


:24376427, 


gi; 


;24376440, 


gi; 


124376449, 


gi: 


:24376457, 


gi; 


;24376467, 


gi: 


:24376477, 


gi; 


;24376489, 


gi; 


124376500, 


gi: 


:24376508, 


gi; 


:24376518, 


gi: 


:24376529, 


gi; 


;24376540, 


gi: 


;24376558, 


gi: 


:24376568, 


gi; 


:24376580, 


gi: 


:24376598, 


gi; 


;24376611, 


gi: 


124376622, 


gi; 


:24376632, 


gi; 


;24376644, 


gi: 


:24376656, 


gi; 


;24376666, 


gi: 


124376676, 


gi; 


;24376687, 


gi: 


;24376701, 


gi: 


:24376713, 


gi; 


124376731, 


gi: 


;24376740, 


gi; 


:24376746, 


gi: 


;24376759, 


gi: 


:24376773, 


gi; 


:24376784, 


gi; 


;24376798, 


gi; 


:24376811, 


gi; 


;24376821, 


gi; 


:24376833, 


gi; 


:24376843, 


gi; 


:24376851, 


gi; 


:24376864, 


gi; 


;24376874, 


gi; 


:24376885, 


gi; 


;24376898, 


gi: 


;24376914, 


gi; 


:24376926, 


gi: 


:24376935, 


gi; 


:24376948, 


gi; 


;24376961, 


gi: 


:24376975, 


gi; 


;24376986, 


gi: 


:24376994, 


gi: 


:24377004, 


gi; 


;24377016, 


gi: 


124377027, 


gi; 


:24377040, 


gi; 


;24377051, 


gi; 


:24377061, 


gi; 


;24377070, 


gi: 


124377086, 


gi; 


:24377096, 


gi: 


;24377109, 


gi: 


:24377120, 


gi; 


;24377134, 


gi: 


124377145, 


gi; 


;24377156, 


gi: 


;24377170, 


gi: 


:24377181, 


gi; 


;24377193, 


gi: 


124377198, 


gi; 


:24377208, 


gi: 


;24377217, 


gi: 


:24377232, 


gi; 


;24377241, 


gi: 


124377253, 


gi; 


:24377262, 


gi: 


;24377272, 


gi: 


:24377283, 


gi; 


;24377293, 


gi: 


;24377304, 


gi; 


:24377317, 


gi: 


;24377330, 


gi; 


:24377345, 


gi; 


:24377356, 


gi: 


;24377366, 


gi; 


124377378, 


gi: 


:24377385, 


gi; 


:24377400, 


gi; 


:24377414, 


gi: 


;24377428, 


gi; 


:24377442, 


gi: 


;24377455, 


gi; 


:24377469, 


gi; 


:24377479, 


gi: 


:24377490, 


gi; 


:24377501, 


gi: 


:24377514, 


gi; 


:24377528, 


gi; 


:24377540, 


gi: 


:24377549, 


gi; 


:24377559, 


gi: 


;24377569, 


gi; 


:24377578, 



gi:24377588, 
gi:24377653, 
gi:24377714, 
gi:24377767, 
gi:24377821, 
gi:24377876, 
gi:24377933, 
gi:24377983, 
gi:24378055, 
gi:24378115, 
gi:24378171, 
gi:24378228, 
gi:24378298, 
gi:24378356, 
gi:24378416, 
gi:24378472, 
REFERENCE: 

AUTHOR (AU) : 



:24377599, gi:24377613, 
:24377664, gi:24377678, 
:24377725, 
gi:24377791 



:24377718, 
:24377778, 
:24377837, 
:24377883, 
:24377942, 
:24377998, 
:24378065, 
:24378119, 
:24378183, 



:24377623, gi:24377640, 
:24377690, gi:24377702, 
:24377743, gi:24377757. 



24377802, gi:24377812, 

gi:24377847, gi:24377858, gi:24377867, 

gi:24377895, gi:24377909, gi:24377922, 

gi:24377950, gi:24377959, gi:24377971, 

gi:24378012, gi:24378027, gi:24378041, 

gi:24378076, gi:24378090, gi:24378102, 

gi:24378133, gi:24378147, gi:24378154, 

' ' ' ' ' ' ' ' " 24378217, 



3i:24378195, gi:24378206. 



gi:24378240, gi:24378249, gi:24378266, gi:24378285. 



:2437830l 
:24378366 

:24378424 
:24378487 



gi:24378319, gi:24378334, 
gi:24378387, gi:243783S 



24378349, 
gi:24378403, 

gi:24378461, 
gi:24378520 . 



TITLE (TI) : 

JOURNAL (SO) : 

OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO) : 



gi:24378434, gi:24378449, 
gi:24378500, gi:24378511, 

1 (bases 1 to 2030921) 

Ajdic,D.; McShan, W.M. ; McLaughlin, R . E . ; Savic,G.; 
Chang, J.; Carson, M.B. ; Primeaux, C . ; Tian,R.; Kenton, S., • 
Jia,H.; Lin,S.; Qian,Y.; Li,S.; Zhu,H.; Najar,F.; 
Lai,H.; White, J.; Roe,B.A.; Ferretti, J . J . 
Genome sequence of Streptococcus mutans UA159, a 
cariogenic dental pathogen 

Proc. Natl. Acad. Sci. U.S.A., 99 (22), 14434-14439 
(2002) 

CA 137:364100 

2 (bases 1 to 2030921) 

Ajdic,D.; McShan, W.M. ; McLaughlin, R . E . ; Savic,G.; 
Chang, J.; Carson, M. B . ; Pr imeaux, C . ; Tian,R.; Kenton, S., • 
Jia,H.; Lin,S.; Qian,Y.; Li,S.; Zhu,H.; Najar,F.; 
Lai,H.; White, J.; Roe,B.A.; Ferretti , J . J . 
Direct Submission 

Submitted ( 09-JUL-2002 ) Department of Microbiology and 
Immunology, University of Oklahoma Health Sciences 
Center, 940 SL Young Blvd., Oklahoma City, OK 73104, 
USA 



FEATURES (FEAT) : 
Feature Key 



1. .2030921 



Qualifier 

/organism=" Streptococcus mutans 
UA159" 

/mol-type="genomic DNA" 

/strain="UA159" 

/db-xref="taxon:210007" 



L5 
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GENBANK® COPYRIGHT 2008 on STN 



LOCUS (LOC) : 

GenBank ACC. NO. (GBN) 

GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

KEYWORDS (ST) : 
SOURCE : 
ORGANISM (ORGN) : 



BX571856 GenBank (R) 

BX571856 

BX571856.1 GI:49240382 

726687-34-9 

2902619 

DNA; circular 

Bacteria 

7 Oct 2006 

Staphylococcus aureus subsp. aureus strain MRSA252, 
complete genome, 
complete genome 

Staphylococcus aureus subsp. aureus MRSA252 
Staphylococcus aureus subsp. aureus MRSA252 



REFERENCE : 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 

OTHER SOURCE 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI): 
JOURNAL (SO) : 



FEATURES (FEAT) : 
Feature Key 



Bacteria; Firmicutes; Bacillales; Staphylococcus 

1 (bases 1 to 2902619) 

Holden,M.T.; Fell, E. J.; Lindsay, J. A. ; Peacock, S . J . ; 
Day,N.P.; Enright , M. C . ; Foster , T . J . ; Moore, C.E., • 
Hurst, L.; Atkin,R.; Barron, A.; Bason, N.; Bentley, S . D . ; 
Chillingworth, C. ; Chillingworth, T . ; Churcher,C.; 
Clark, L.; Corton,C.; Cronin,A.; Doggett,J.; Dowd,L.; 
Feltwell, T. ; Hance,Z.; Harris, B.; Hauser,H.; 
Holroyd,S.; Jagels,K.; James , K . D . ; Lennard,N.; Line, A., • 
Mayes, R.; Moule,S.; Mungall,K.; Ormond,D.; Quail, M. A.; 
Rabbinowitsch, E . ; Rutherford, K. ; Sanders, M.; Sharp, S.; 
Simmonds,M. ; Stevens, K.; Whitehead, S . ; Barrell, B . G . ; 
Spratt,B.G. ; Parkhill,J. 

Complete genomes of two clinical Staphylococcus aureus 
strains: evidence for the rapid evolution of virulence 
and drug resistance 

Proc. Natl. Acad. Sci. U.S.A., 101 (26), 9786-9791 
(2004) 

CA 141:152000 

2 (bases 1 to 2902619) 
Holden,M.T.G. 

Direct Submission 

Submitted (23-JUN-200 4) Submitted on behalf of the 
Pathogen Sequencing Unit, Sanger Institute, Wellcome 
Trust Genome Campus, Hinxton, Cambridge CBIO ISA, 
E-mail: mh3@sanger.ac.uk 



Qualifier 



1. .2902619 



/ organism=" Staphylococcus aureus 
subsp. aureus MRSA252" 
/mol-type="genomic DNA" 
/strain="MRSA252" 
/sub-species=" aureus" 
/db-xref="taxon:282458" 



ANSWER 2 0 OF 2 7 USPATFULL on STN 

2003:71519 USPATFULL 

Corynebacterium glutaraicum genes encoding metabolic pathway proteins 

Pompejus, Markus, Freinsheim, GERMANY, FEDERAL REPUBLIC OF 

Kroger, Burkhard, Limburgerhof , GERMANY, FEDERAL REPUBLIC OF 

Schroder, Hartwig, Nussloch, GERMANY, FEDERAL REPUBLIC OF 

Zelder, Oskar, Speyer, GERMANY, FEDERAL REPUBLIC OF 

Haberhauer, Gregor, Limburgerhof, GERMANY, FEDERAL REPUBLIC OF 

Kim, Jun-Won, Seoul, KOREA, REPUBLIC OF 

Lee, Heung-Shick, Seoul, KOREA, REPUBLIC OF 

Hwang, Byung-Joon, Seoul, KOREA, REPUBLIC OF 

US 20030049804 Al 20030313 

US 2000-746660 Al 20001222 (9) 

Continuation-in-part of Ser. No. US 2000-606740, filed on 23 Jun 2000, 
PENDING Continuation-in-part of Ser. No. US 2000-603124, filed on 23 Jun 

2000, PENDING 

DE 1999-19931420 19990708 
US 1999-141031P 19990625 (60) 

US 1999-142101P 19990702 (60) 

US 1999-148613P 19990812 (60) 

US 2000-187970P 20000309 (60) 



DT Utility 

FS APPLICATION 

LN.CNT 15004 



IC 



INCLM: 
INCLS: 
NCLM: 
NCLS: 

[7] 



435/115.000 

435/069.100; 435/252.300 



435/115.000 
435/069.100; 435/lS 



435/320.100; 435/183.000; 536/0 
3.000; 435/252.300; 435/320.100; 536/0 



23.200 
23.200 



ICM C12P013-08 

ICS C07H021-04; C12N009-00; C12P021-02; C12N001-21 

IPCI C12P0013-Q8 [ICM,7]; C12P0013-00 [ICM,7,C*]; C07H0021-04 [ICS, 7]; 

C07H0021-00 [ICS,7,C*]; C12N0009-00 [ICS, 7]; C12P0021-02 [ICS, 7]; 

C12N0001-21 [ICS, 7] 
IPCR C07K0014-195 [I,C*]; C07K0014-34 [I, A]; C12N0001-21 [I,C*]; 

C12N0001-21 [I, A]; C12N0009-00 [I,C*]; C12N0009-00 [I, A]; 

C12N0009-18 [I,C*]; C12N0009-18 [I, A]; C12N0009-90 [I,C*]; 

C12N0009-90 [I, A] 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L5 ANSWER 21 OF 27 USPATFULL on STN 
AN 2003:95966 USPATFULL 

TI Polynucleotides, materials incorporating them, and methods for using 

them 

IN Glenn, Matthew, Auckland, NEW ZEALAND 

Havukkala, Ilkka J., Auckland, NEW ZEALAND 

Bloksberg, Leonard N., Auckland, NEW ZEALAND 

Lubbers, Mark W., Palmerston North, NEW ZEALAND 

Dekker, James, Palmerston North, NEW ZEALAND 

Christensson, Anna C, Lund, SWEDEN 

Holland, Ross, Palmerson North, NEW ZEALAND 

0' Toole, Paul W., Palmerston North, NEW ZEALAND 

Reid, Julian R., Palmerston North, NEW ZEALAND 

Coolbear, Timothy, Palmerston North, NEW ZEALAND 
PA Genesis Research & Development Corp. Ltd, Parnell, NEW ZEALAND (non-U. S. 

corporation) 

Via Lachia Bioscience (NZ) Ltd., Auckland, NEW ZEALAND (non-U. S. 

corporation) 
PI US 6544772 Bl 20030408 

AI US 2000-634238 20000808 (9) 

DT Utility 
FS GRANTED 
LN.CNT 2 015 

INCL INCLM: 435/252.300 

INCLS: 435/320.100; 536/023.700 
NCL NCLM: 435/252.300 

NCLS: 435/320.100; 536/023.700 
IC [7] 

ICM C12N001-21 

ICS C12N015-63; C12N015-31 

IPCI C12N0001-21 [ICM, 7]; C12NQQ15-63 [ICS, 7]; C12N0015-31 [ICS, 7] 
IPCR A23C0019-00 [I,C*]; A23C0019-032 [I, A]; C07K0014-195 [I,C*]; 
C07K0014-335 [I, A]; C12N0001-21 [I,C*]; C12N0001-21 [I, A]; 
C12N0015-31 [I,C*]; C12N0015-31 [I, A] 
EXF 435/252.3; 435/320.1; 536/23.7 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



=> d 15 22 



L5 ANSWER 2 2 OF 2 7 BIOTECHDS COPYRIGHT 2008 THOMSON REUTERS on STN 

AN 1999-00562 BIOTECHDS 

TI Use of lactic and propionic acid bacteria; 

to bind mycotoxin to prevent their absorption or to purify 

contaminated food or feedstuff 

AU El-Nezami H; Kankaanpaa P; Salminen S; Ahokas J 

PA Roy .Melbourne-Inst . Technol . 

LO Melbourne, Victoria, Australia. 

PI WO 9834503 13 Aug 1998 

AI WO 1998-AU63 6 Feb 1998 

PRAI AU 1997-5005 7 Feb 1997 

DT Patent 

LA English 

OS WPI: 1998-557001 [49] 



=> d hist 

(FILE 'HOME' ENTERED AT 13:55:21 ON 07 JUL 2008) 

INDEX 'ADISCTI, ADISINSIGHT, ADISNEWS, AGRICOLA, ANABSTR, ANTE, AQUALINE, 
AQUASCI, BIOENG, BIOSIS, BIOTECHABS, BIOTECHDS, BIOTECHNO, CABA, CAPLUS, 
CEABA-VTB, CIN, CONFSCI, CROPB, CROPU, DDFB, DDFU, DGENE, DISSABS, DRUGB, 
DRUGM0N0G2, DRUGU, EMBAL, EMBASE, ...' ENTERED AT 13:55:47 ON 07 JUL 2008 
SEA LACTOBACILLUS REUTERI AND TOXIN (P) BIND? AND CD4+ CELL? AND 



0* FILE ADISCTI 
SEA LACTOBACILLUS REUTERI AND TOXIN (P) BIND? AND CD4+ CELLS 



0* FILE ADISCTI 
SEA LACTOBACILLUS REUTERI AND TOXIN (P) BIND? 



0* 


FILE 


ADISNEWS 


0* 


FILE 


ANTE 


0* 


FILE 


AQUALINE 


0* 


FILE 


BIOENG 


1* 


FILE 


BIOTECHABS 


1* 


FILE 


BIOTECHDS 


0* 


FILE 


BIOTECHNO 


2 


FILE 


CAPLUS 


0* 


FILE 


CEABA-VTB 


0* 


FILE 


CIN 


0* 


FILE 


ESBIOBASE 


0* 


FILE 


FOMAD 


0* 


FILE 


FOREGE 


2* 


FILE 


FROSTI 


0* 


FILE 


FSTA 


5 


FILE 


GENBANK 


1 


FILE 


IFIPAT 


0* 


FILE 


KOSMET 


0* 


FILE 


NTIS 


0* 


FILE 


NUTRACEUT 


1* 


FILE 


PASCAL 


0* 


FILE 


PHARMAML 


1 


FILE 


SCI SEARCH 


1 


FILE 


TOXCENTER 


21 


FILE 


USPATFULL 


3 


FILE 


USPAT2 


0* 


FILE 


WATER 


2 


FILE 


WPIDS 


2 


FILE 


WPINDEX 



LI QUE LACTOBACILLUS REUTERI AND TOXIN (P) BIND? 



FILE 'BIOTECHDS, CAPLUS, FROSTI, GENBANK, IFIPAT, PASCAL, SCISEARCH, 
TOXCENTER, USPATFULL, USPAT2 ' ENTERED AT 13:59:37 ON 07 JUL 2008 

L2 38 S LI 

L3 3 4 DUP REM L2 (4 DUPLICATES REMOVED) 

L4 27 S L3 AND (TABLET OR FOOD OR DIETARY SUPLEMENT OR CONFECTIONERY 

L5 2 7 DUP REM L4 (0 DUPLICATES REMOVED) 

=> logoff 

ALL L# QUERIES AND ANSWER SETS ARE DELETED AT LOGOFF 
LOGOFF? (Y)/N/HOLD:y 

COST IN U.S. DOLLARS SINCE FILE TOTAL 

ENTRY SESSION 

FULL ESTIMATED COST 174.83 178.94 

STN INTERNATIONAL LOGOFF AT 14:06:30 ON 07 JUL 2008 



Connecting via Winsock to STN 



Welcome to STN International! 
LOGINID:sssptl89dxw 



PASSWORD: 

TERMINAL (ENTER 1, 



********** Welcome to STN International ********** 

NEWS 1 Web Page for STN Seminar Schedule - N. America 

NEWS 2 NOV 21 CAS patent coverage to include exemplified prophetic 
substances identified in English-, French-, German-, 
and Japanese-language basic patents from 2004-present 
NEWS 3 NOV 26 MARPAT enhanced with FSORT command 
NEWS 4 NOV 26 CHEMSAFE now available on STN Easy 
NEWS 5 NOV 26 Two new SET commands increase convenience of STN 
searching 

NEWS 6 DEC 01 ChemPort single article sales feature unavailable 
NEWS 7 DEC 12 GBFULL now offers single source for full-text 

coverage of complete UK patent families 
NEWS 8 DEC 17 Fifty-one pharmaceutical ingredients added to PS 
NEWS 9 JAN 06 The retention policy for unread STNmail messages 

will change in 2009 for STN-Columbus and STN-Tokyo 
WPIDS, WPINDEX, and WPIX enhanced Japanese Patent 
Classification Data 

Simultaneous left and right truncation (SLART) added 
for CERAB, COMPUAB, ELCOM, and SOLIDSTATE 
NEWS 12 FEB 02 GENBANK enhanced with SET PLURALS and SET SPELLING 
NEWS 13 FEB 06 Patent sequence location (PSL) data added to USGENE 
NEWS 14 FEB 10 COMPENDEX reloaded and enhanced 
NEWS 15 FEB 11 WTEXTILES reloaded and enhanced 

NEWS 16 FEB 19 New patent-examiner citations in 300,000 CA/CAplus 
patent records provide insights into related prior 
art 

NEWS 17 FEB 19 Increase the precision of your patent queries — use 
terms from the IPC Thesaurus, Version 2009.01 



NEWS 10 JAN 07 



NEWS 11 FEB 02 



NEWS 18 FEB 23 Several formats for image display and print options 

discontinued in USPATFULL and USPAT2 
NEWS 19 FEB 23 MEDLINE now offers more precise author group fields 

and 2009 MeSH terms 
NEWS 20 FEB 23 TOXCENTER updates mirror those of MEDLINE - more 

precise author group fields and 2009 MeSH terms 
NEWS 21 FEB 23 Three million new patent records blast AEROSPACE into 

STN patent clusters 
NEWS 22 FEB 25 USGENE enhanced with patent family and legal status 

display data from INPADOCDB 
NEWS 23 MAR 06 INPADOCDB and INPAFAMDB enhanced with new display 

formats 

NEWS 24 MAR 11 EPFULL backfile enhanced with additional full-text 
applications and grants 

NEWS 25 MAR 11 ESBIOBASE reloaded and enhanced 

NEWS 26 MAR 20 CAS databases on STN enhanced with new super role 

for nanomaterial substances 
NEWS 27 MAR 23 CA/CAplus enhanced with more than 250,000 patent 

equivalents from China 

NEWS EXPRESS JUNE 27 08 CURRENT WINDOWS VERSION IS V8.3, 

AND CURRENT DISCOVER FILE IS DATED 23 JUNE 2008. 

NEWS HOURS STN Operating Hours Plus Help Desk Availability 
NEWS LOGIN Welcome Banner and News Items 

NEWS IPCS For general information regarding STN implementation of IPC 8 

Enter NEWS followed by the item number or name to see news on that 
specific topic. 

All use of STN is subject to the provisions of the STN customer 
agreement. This agreement limits use to scientific research. Use 
for software development or design, implementation of commercial 
gateways, or use of CAS and STN data in the building of commercial 
products is prohibited and may result in loss of user privileges 
and other penalties. 

************** STN Columbus *************** 
FILE 'HOME' ENTERED AT 17:02:18 ON 29 MAR 2009 



=> index bioscience 

FILE 'DRUGMONOG' ACCESS NOT AUTHORIZED 
COST IN U.S. DOLLARS 

FULL ESTIMATED COST 



SINCE FILE 
ENTRY 
0.22 



TOTAL 
SESSION 
0.22 



INDEX 'ADISCTI, ADISINSIGHT, ADISNEWS, AGRICOLA, ANABSTR, ANTE, AQUALINE, 

AQUASCI, BIOENG, BIOSIS, BIOTECHABS, BIOTECHDS, BIOTECHNO, CABA, CAPLUS, 
CEABA-VTB, CIN, CONFSCI, CROPB, CROPU, DDFB, DDFU, DGENE, DISSABS, DRUGB, 
DRUGM0N0G2, DRUGU, EMBAL, EMBASE, ...' ENTERED AT 17:02:31 ON 29 MAR 2009 

68 FILES IN THE FILE LIST IN STNINDEX 



Enter SET DETAIL ON to see search term postings or to view 
search error messages that display as 0* with SET DETAIL OFF. 

=> s Lactobacillus reuteri and toxi? and CD+4 (p) lymphocyte? and cultur? 
0* FILE ADISNEWS 
0* FILE ANTE 
0* FILE AQUALINE 
0* FILE BIOENG 



0* FILE BIOTECHABS 

0* FILE BIOTECHDS 

0* FILE BIOTECHNO 

0* FILE CEABA-VTB 

0* FILE CIN 
21 FILES SEARCHED... 

0* FILE FOMAD 

0* FILE FOREGE 

0* FILE FROSTI 

0* FILE FSTA 

0* FILE KOSMET 
43 FILES SEARCHED. . . 

0* FILE NTIS 

0* FILE NUTRACEUT 

0* FILE PASCAL 

0* FILE PHARMAML 
61 FILES SEARCHED. . . 

0* FILE WATER 

0 FILES HAVE ONE OR MORE ANSWERS, 68 FILES SEARCHED IN STNINDEX 
LI QUE LACTOBACILLUS REUTERI AND TOXI? AND CD+4(P) LYMPHOCYTE? AND CULTUR? 
=> logoff 

ALL L# QUERIES AND ANSWER SETS ARE DELETED AT LOGOFF 
LOGOFF? (Y)/N/HOLD:y 

COST IN U.S. DOLLARS SINCE FILE TOTAL 

ENTRY SESSION 

FULL ESTIMATED COST 2.04 2.26 



STN INTERNATIONAL LOGOFF AT 17:04:21 ON 29 MAR 2009 



